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PHARMACEUTICAL FROIEWS, HUMAN THERAPEPTICS. HUMAN SERUM 
Ar.HTrMTO^ msuUN, NATIVE CHOLERA TOXIC B SUBMITTED ON 
TRANSGENICS PLASTIDS 

5 

RELATED APPLICATIONS 
This patent application claims the benefit of U.S. Proyisional Applicadon No. 
60/185^87, fBed March 1, 2000, 60/263,473, filed January 23, 2001, 60/263,668 filed 
January 23, 2001, 60^3,424, filed January 23, 2001 and flie U.S. Provisional 
10 Application for Hemy Daniell ^ititled Expression of tiie Native Cholera To)dn B 
Suburdt Gene as Oligomers in Transgenic Tobacco Chloroplasts," filed February 23, 
2001. lliis patent ^Hcadon is also related to patent pubHcadonPCT^ 
99/10513 Specific Aims; mtemational publicatfon date 4, March 1990. These earlier 
provisional applications and pubfications are hereby incorporated by xcSsrmx. 

15 

TECHNICAL FIELD 

This invention relates to conq)ositions and methods and products of 
Phannaceutical Proteins, Human TlieF^eutics, Human Serum Albumin, Insulin, Native 
Chol^ Toxic B Submitted On Transgenics Plastids, containing transformed plastids. 

20 

This invraition relates to several embodiments which are disclosed horein in 
several specifications and correq>onding figures titled as Pharmaceutical Proteins, 
Human Therapeutics, Human Senim Albumin, Jbzsulin, Native Oiolera Toxic B 
Submitted On Transgenics Plastids presented as one patent plication and a set of clairDS 
25 diereo£ 

NON^BVIOUS NATURE OF INVENTION 

Despite ^pctcnlial advantages of ddoroplasts for biophamiaceatical production, it was 
30 not obvious that oqHessedpharmaceutical proteins in plastids would assemble in this organelle. 
Prior to this patent application there were no published reports of biopharmaceutical proteins 
expression in chloroplasts. Indeed, there weze valid reasons to suggest that such e3q>re5siQn 
would be problemati& Ptoinsohn contains both a and pdiains and 1heC-pq>tide1ha^ 
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them. It is synthesized as a pre-proinsulin by the panci^^ After proper folding disulfide 
bridges are established l)etwe^ a and P chains. NIH reviewers noted that ddoroplasts 
expression of Mg^ levels of propedy assembled proinsulin was mianticipated. Nor was it 
obvious, as pointed out by NIH reviewers* that proinsulin expressed within plastids would be 
5 folly iiinatioiiaL Iticr to this inventioii, there was no report of esq^^ 
plastids. 

Similarly, piior to fliis invention, there was no report of eaqyression of Human Serum 
Albumin within plastids. Human serum alhumin is a globular protein of 66.5 IcDain si2e that 
should be piopedy folded and stalnlized with seventeen d Badi human seium 

10 albumin consists of three struciuratty shnilar lobular domains and die disulfides are poationed 
in repeated series of nine loop4ink-loop structures cmtered around sequential Cys^Cys 
pairs. HSA is iiutiaUy synttiesized as pie^iro-albuinin by the Uver and re 
endoplasmaticreticuhnnaflerrazioyaloftheaminoterm^^ amino acids. The 

pro-albumin is further processed in the Golgi con^loi wtim the otiia: 6 aminoteiminal residues 

15 of the propeptide are cleaved by a serine proteinase* Tins results in the secretion of mature 
potypqpftideof585 amino acids, ft was likewise unantidpated that fWy assembled HS A would 
be synthesized in large amount wittun plastids. 

Vibrio choierae causes diarrhea by co Ionizing the small irrtestine and producing 
20 euterotoxins, of which the cholera toxin (CT^ is considered the main cause of toxicity. CT is a 
hexameric ABs protein having one 27KDa A subunit which has toxic ADP-ribo syl transferase 
activity and a non-toxic pentamer of 1 1 . 6 kDa B subunits diat are noiM^ovalentfy linked into a 
very stable dougjmut Hke structure into which the toxic active (A) subunit is inserted The A 
subunit of CT consists of two fiagments -Al and A2 ^K^iidi ace 1^^ 
25 oizymatic activity of Cr is located 8ol^ on the Al firagm^tIlieA2fiaginesit of the A subunit 
ticks the Al firagmeot and the B pentan^. CT binds via specific interactions of the B-subunit 
pentamerwithGMl gmgitoside, the membrane receptor, present on the intestinal epithelial cefl 
sur&ce of the host Ilie A subunit is then tiaidocated into die ceH where it ^ 
Os subunit of adenylate c^nctase bringing about the increased levds of cyclic AMP in affected 
30 cells that is associated with the electrolyte and fhiid loss of clinical cholera. However^ tiic B 
subunit, when administered ox^Sfy, is apotent mucosal immunpgen which can neutralize die 
toxicity ofCThobtoxin by preventmg its biodiiig to intestinal .cells. To achieve diis effect, die 
B-subunit must be assembled in a pentameric fonn and disulfide bridges should be established 
among the subunit stmctuies. It was not obvious that plastids could ejqsress and assemble 
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peotameiic CTB. Thcrehasbeennopdoriqpoitof e?q)iesdQnof 
plastids. 

Thm was bo certainty that biophazmaceuticals would assemble iifximally in chloioplasts 
S or that tfaeywovddietain their fmictionality in the pnor^ Indeed, these mi^ have been 
unforeseen deleterious .efifeds of high-level ^qpression of biophannaceoticals in dbloropbsts on 
plant growdi or development that were not apgex&cA &am the e:q>eriences with othra: Iransgenes. 
The pH and oxidation state of die chloroplast differs fibm diat of tiie human blood or bacterial 
ceU in ways that mi^ inhibit or prevent proper folding and assembly. 

i 0 Tlieic arc examples of protein complexes in die chloroplast in which all the subonits are 

native to the plants the ribosome being an example. Howev^, the expression and assembly in 
transformed chloroplasts of heterologous proteins into multi-^porotein conq^lexes has not been 
reported undl the present invention. ThercisasingleexanQ^leinthetiteratoreof anmter-chain 
disulfide bond in plant chloroplasts, and ttiat is between neighboring large subonits of the 

15 ^E^meribulose-l, S-biphosphase carboa^ase/oxygenase. The esqxression and assembly in 
transformed ^iloroplasts of fonctkmal proteins coosisting of difiKsrent protein chainf8» fnclnding 
disulfide bcmds between diffoent subonits, as rcfxesented by expression and assembly of a 
mawiifmliaTi proteui has never been demonstialed untQ die present invention. 
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PRODUCTION OF PHARMACEUTICAL PROTEINS 
m TRANSGENIC CHLOROPLASTS 

RELATED APPLICATION 
This patent plication claizos benefit of U.S. ProvisiQnal Application No. 
60/1 85,987, filed Mardi 1,2000. ThiisearUerp]t>visional^pU<»itkmisherd)y 
by reference. 

FIELD OF THE INVENTION 

This inventioii tdates to die production of pharmaceutical proteins in transgenic 
chloroplasts. More parttculaxly, Hm invention relates to "die production of faunm insulin in 
tobacco plants. 

BACKGROUND 

Research efforts have been made to synthesize high value pharmacologically active 
recombinant proteins in plants. Recombinant proteins such as vaccines, monoclonal 
antibodies, honnones, growth &ctors, neuropeptides, cytotoxins, serum proteins and 
enzymes have been expressed in nuclear transgenic plants (May aL, 1996). It has been 
estimated that one tobacco plant should be able to produce more recombinant protein than 
a 300-liter fermenter of E. coU. In addition* a tobacco plant produces a million seeds, 
hereby facilitating large-scale production. Tobacco is also an ideal choice because of its 
relative ease of g^etic manipulation and an impending need to e;^lore alternate uses for 
this hazardous crop. 

A primary reason for the high cost of production via fermentation is the cost of carbon 
source co-substances as well as maintenance of a large fermentation facility. In contrast, 
most estimates of plant production are a thousand-fold less expensive than fermentation. 
Tissue specific expression of high value proteins in leaves can enable the use of crop plants 
as renewable resources. Harvesting the cobs, tubers, seeds or fruits for food and feed and 
leaves for value added products should result in fisrther economy with no additional 
investmrat 
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However, one of the major limitatioiis mprodudngphannaceuti^ 
is thdr low level of fordgn protein expicssion, despite r^rts of hi^ber level repression of 
enzymes and certain proteins. May et al. (1998) discuss tliis problmi using the following 
exan^les. AllhoughplantdecivedieconibinanthepatitisBsiir&ce 
as a commercial recombinant vacdne, tiie levels of expression in transgenic tobacco were 
low (0.01% of total soluble protdn). Even fliough Norwalk virus capsid protein ^xptessed 
in potatoes caused oral immunization when consumed as food (edible vaccine), ^ression 
levels were low (0.3% of total soluble protein). A synthetic gene coding for tbe human 
qridermal growth ^ctor was expressed only up to 0.001% of total sohible protein in 
transonic tobacco. Human serumalbumrn has been e^qaessedonly up to 0.02% of the total 
soluble protein in transgenic plants. 

Ilierefore, it is ioqK)rtam to inorease levels of e 
plants to exploit plant i»roduction of pharmacologically important protdns. An alternate 
^)proachistoexpressforetgnproteinsincUorQplastsofhig^erp^ Foreign genes to 
10,000 copies per cell) have bera incorporated into the tobacco chloroplast genome resulting 
in accumulation of lecombinant proteins xsp to 30% of tfie total cellular protein (McBride et 
al., 1994). 

The aforementioned approaches (except chloroplast transformation) are limited to 
eukaryotic gene expression because prokaiyotic genes are ^qyiessed poorly in die nuclear 
con;>artment However, several pharmacologically important protems (such as insulm, 
human serum albumin, antibodies, en:^mes etc.) are produced currently in K colu Also, 
several bacterial proteins (such as cholera toxin B subunit) are used as oral vaccines against 
diarrheal diseases. Therefore, it is important to develop a plant production system for 
ejqpression of pharmacologically inq)ortant proteins that axe currently produced in 
prokaiyotic systems (such as E. coli) via fermentation. 

CHoroplasts are prokaiyotic compartments inside eukaryotic cdls. Since the 
transOTptional and translational machinery of the chloroplast is similar to E. coli (Brixey et 
al., 1997), it is possible to express prokaiyotic genes at very high levels in plant chloroplasts 
than in the nucleus. In addition, plant cells containup to 50,000 copies of the circularplastid 
genome (Bendich 1987) which may an^Kfy the foreign gene like a ^lasmid in the plant 
cell,*' thereby enabling higher levels of expression . Therefore, chloroplasts are an ideal 
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choice for eaqsiession of recombiimt pro coH (such 

as rnsulin^ famnan senim albumin, vacdnes» antibodies, etc.). We e^loited die chloroplast 
tcansfonnation approach to express apbannacological protean fhat is of no value to Replant 
to dmnonstrate fhis conc^t, GVGVP gene has been syndiesized with a codon prefenred for 
S pxokaryotic (EG121) or eokaiyotic (TGI31) expression. Based on transcript levels, 
chlon^last expression of this polymer was a hundred-fold higher than nuclear eaqxression ' 
in transg^c plants (Guda et al.» 1999). Recendy, we observed 16^6-foId more tps 1 
transects in ddootoplast transfcmnants than the hig^ escpressing nuclear transgenic plants 
(Lee et aL 2000, in review). 

10 

DETAILED DESCRIPTION 

Jn our research^ we use insulin as a model protein to demonstrate its production as a 
value added trait in transgenic tobacco. Most inq)ortantly, a significant advantage in the 
production of pharmaceutical proteins in chloroplasts is their ability to process eukaryotic 

IS protein, including folding and formation of disulfide bridges (Dreshcher et al., 1998). 
Chap^ronin proteins are present in chloroplasts (Verling 1991 ; Roy 1989) that function in 
folding and assembly of prokaryotic/eukaryotic proteins. Also, proteins are activated by 
disulfide bond oxido/reduction cycles using the chloroplast roicredoxin system (ReuUand and 
Miginiac-Maslow, 1999) or chloroplast protein disulfide isom^:ase (Kim and Nfayfield, 

20 1997). Accumulation of fidly assembled, disulfide bonded form of antibody inside 
cUoroplasts, even though plasties wa?enottransformed(Du^ 1990), provides ^rong 
evidence for successfial assembly of proinsulin inside chloroplasts. Indeed, we observed 
fiilly assembled heavy and light chains of humanized Guy's 13 antibody in transgenic 
tobacco chloroplasts (Panchal et al. 2000, in review). Such folding and assembly elindnates 

25 theneedforpost-^urificationprocessingofphaimaceuticalproteiiis. Chloroplasts may also 
be isolated from crude homogenates by centrifugation (1500 X g). This fiaction is free of 
oth^ cellular proteins, delated chloroplasts are burst open by osmotic shock to release 
foreign inroteins that are compartmentalized in this organelle along with few other native 
soluble proteins (Daniel and McFadden, 1987). 

30 GVGVP is a PBP made from synthetic genes. At lower temperatures the polymers 

exist as more extended molecules which, on raising the tenq)e3:atiire above the transition 
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xaDge^liydrophobically fold into dyaamic stnictores caUed fi-sfpirals that fhrdi^ aggregate 
tyy bydrophobic assodatioii to farm twisted filaments (Uny» 1991; Uny, et al., 1994). 
Inverse temperature transition ofifers sev^ advantages. Expense associated witti 
duromatogrq>luc tesins and eqoipmoit aie eliminated. It also fadHtates scale iip of 
5 pnrificatiQn from grams to kilograms. Nfild^ purificatian conditions use only a modest 
change in terq>aatnre and ionic strengdi. This also fadlitates higher reooveiy, &ster 
pinificatian and hig^ Yohnne processing. Protein purificatian is generally fhe slow step 
(botdeneck)inphaimaceofticalproductdeyelopment Ihcough exploitation of Ifaisieversible 
inverse temporatuFe transition property, simple and ine?q)ensive extraction and purification 
10 isp^ormed. The temperature at \)s^ch the aggregation tAcs place can 

engineering bic^Iymers contaimng varying numbers of repeats and changing salt 
concentration in solution (McPherson et al., 1996). Chloroplast mediated expression of 
insulin-^lymer fusion protein elinunates the need for the &q»ensive fennentation process 
as wdl as reagents needed forrecombinantpiotein purification and downstream processing. 

15 

Large-scale prodoction of insulin in plants in conjunction with an oral delivray system 
is a powerful q>proach to provide insulin to diabetes patients at an affordable cost and 
provide tobacco fanners alternate uses for this hazardous crop. For example. Sun et aL 
(1994) showedthatfeedingasxnaUdoseof antigens coiyugated to the receptor biiidingnoxi- 

20 toxic B subunit moi^ of the cholera toxin (CTB) sqjpressed systonic T cell-mediated 
inflammatoiy reactions in animals. Oral administmtion of a myelin antigen conjugated to 
CTB has been shown to protect animals against encq>halomyelitis, even when given after 
disease induction (Sun et al. 1996). Bergm)t et al. (1997) reported that feeding small 
amounts of human insulin conjugated to CTB si^ressed beta cell destruction and clinical 

25 diab^ in adult non-obese diabetic (NOD) mice. The protective effect could be transferred 
by T cells from CIB-insulin treated animals and was associated with reduced insuUtis. 
These results demonstrate diat protection against autoimmune diabetes can indeed be 
achieved by feeding small amounts of pancreas islet cell auto antigen linked to CTB 
(Bergerot, ^ al. 1997). Conjugation with CTB facilitates antigrai delivery and presentation 

30 to the Out Associated Lymphoid Tissues (GALT) due to its affinity for tiie cell surface 
receptor GM-ganglioside located on GALT cells, for increased uptake and immunologic 
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recognition (Arakawa et al. 1998). Tiansgcnic potato tubers expressed up to 0.1% CTB- 
insulin &sion proteca of total soluble protein, which retained GM-ganglioside binding 
a£Bnity and native autogenic^ for both CTB and insuM NOD mice fed with transgenic 
potato tubecs containing nncrogram quantities of CTB-insulin fusion protein showed a 
substantial teduction in insulitis and a dday in the progression of di^>etes (Arkawa et al., 
1998). However, for commercial exploitation, tiie levels of expression need to be ina:eased 
in transgenic plants. Therefore, we imdertook die eTqnression of CTB-insuIin fiision in 
transgenic cUoroplasts of nicotine fiee edible tobacco to increase levels of expression 
adequate for animal testing. 

Jn accordance wiHi one advantageous feature of this invention, we use poIy(CjVGVP) 
as a fusion protein to enable hyper-«^iression of insulin and acconq)lish rapid one step 
purifusition of fusion pq>tides utiliztng the inverse tenq>erature transition properties of this 
polym^. In anofhra: advantageous feature of this invoition, we develop insulin-CTB fusion 
protein for oral delivery in nicotine finee edible tobacco (LAMD 60S). Bodi features are 
accon^)lished as follows: 

a) Develop recombinant DNA vectors for enhanced e^qiression of Proinsulin as fusion 
proteins with GVGVP or CTB via chloroplast genomes of tobacco, 

b) Obtain transgenic tobacco (Petit Havatia& LAMD 605) plants, 

c) Characterize transgenic egression of proinsulin polymer or CIB fiisian protrins 
using molecular and biodiemical methods in chloroplasts, 

d) Eooploy existing orinodifiedniethodsofpolymerpurificationfiromtiansgmic leaves, 

e) Analyze Mendelian or mat^nal inheritance of transgenic plants^ 

f) Large scale purification of insulin and conqiarison of cmrent insulin purification 
mefhiods with polymer-based purification method in E, coli and tobacco^ 

g) Compare natural refolding chlorpplasts with in vitro processing, 

h) Characterizatton (yield and purity) of proinsuHn produced in E, coli and transgenic 
tobacco, and 

i) Assessment of diabetic synqitoms in nuce fed with edible tobacco expressing CTB- 
insulin fusion proteiiL 

Diabetes and Insulin: Insulin lowers blood ghicose(OaldyetaL 1973). This is a result of 
its immediate effect in increasing glucose uptake in tissues. In muscle, under &e action of 
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insalin, ^ncose is more readily taken up and d&er converted to g3ycogeQ and lactic acid or 
oxidized to caibon dioxide. Insulin also affects a inimber of ing>ortante3^^ 
with cellular metabolism. It increases flie activity of ghicokinase, which pho^horyiates 
glucose, tfiereby increasing the rate of glucose metabolism in tiie liver. BasuHn also 
5 siq^nresses ghiconeogenesis by depressing the function of liver enzymes, which operate the 
reverse pa&way iGrom proteins to ghicose. Lack of insulin can restrict the transport of 
ghcose into muscle and adipose tissue. This results in increases in blood ghicose levels 
Oiypergjiycemia). In addition, tiiejhreakdownofnatural fat to free i&tty adds and g^ 
is increased and there is a rise in the &tty acid contmt in the blood. Increased catabolism of 

10 fitty adds by the Uver results in greater prodocdcm of tetone bodies. Th^diffiosefiomthe. 
liver and pass to the muscles for further oxidation. Soon, ketone bo^ production rate 
exceeds oxidationrateandketosis results. Feweraminoaddsaietakmiq>byftetissuesand 
protein degradation results. At the same time, gluconeogenesis is stimulated and protein is 
used to produce glucose. Obviously, lack ofinsulin has serious consequences. 

IS Diab^es is dassified into types I and n. Type I is also known as insulin dependent 

diabetesmellitus([DDM). UsuaUytfaisiscausedbyaceU-niediatedautoininumedestrttction 
of the pancreatic p-cells (Davidson, 1998). Those suffering from this type are dependent on 
external sources of insulin. I^P^ ^ known as noninsulin-dependent diabetes mellitus 
(NIDDM). This usually involved resistance to insulin in combinaticm with its 

20 underproduction. Tbese prominent diseases have led to extei^vereseardi into microbial 
production of recombinant famnan insulin (rHT). 

Expression of Recombinant Human Insulin in coUi In 1978, two diousand tdlograms 
of insulin weane used in the woild each yean half of this was used in die United States 
(Steiner et al., 1978). At that time, the number of diabetics in the US were increasing 6% 

25 evQyyear(Gunby, 1978). In 1997 - 98, 10% increase in sales ofdiabetes care products and 
19% increase in insulin products have been r^rted by Novo Nordisk (world's leading 
supplier of insulin), making it a 7.8 billion dollar industry. Annually, 160,000 Americans 
are killed by diabetes, making it the fourfli leading cause of death. Many meftods of 
production of rHI have been developed. Insulin genes were first chemically synthesized for 

30 expression m Esherichia coli (Crea ct aL, 1978). These genes encoded sq)arate insulin A 
and B diaios. The genes vf&ce each expressed m £1 coli as fusion proteins with the P- 
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galactosidase (Goeddel et al., 1979). The fiist documented producdon of iHI using this 
systeoi was reirartedby David Goedddfix)mGeaentech(^^ 1988). Forieasons eT^lained 
later, fhe genes were fused to the Tip synthase gene. This fusion protein was q)proYed for 
cQmmercial producdon by EU Lilly in 1982 (Chance and Frank, 1993) with a product name 
ofHumulin. Asofl986,Humulinwa5producedfiomproinsulin genes. Proinsulin contains 
both insulin chains and the Opeptide ttiat connects them. Data concerning commeicial 
production ofHunxuIin and otibesr insofin products is now considered proprietary in&miation 
and is not available to die public. 

Delivery of Human Insulin: Insulin has been deUveredintravenoiisdy in the past severd 
years. However, inorexecendy, alternate ine&ods such as nasal spray are also a^ 
Oral deUvexy of insulin is yet anotiier new q3proach(Ma^ 1997). Engineered 

pofymer microspheres made of biologically eiodable polymers, which display strong 
interactions wifii gastromtestmal mucus and cellular lixiings, can traverse both mucosal 
absorptive epithelimn and the foUicle-assodated epithelium, covering the lymphoid tissue 
of Peyer's patches. Polymers maintain contact with mtestnial epithelium for extended 
periods of time and actually penelrate through and between cells. Animals fed with titie 
poly(FA: PLGA)-encapsulated insulin preparation were able to regulate the glucose load 
b^ter than controls, confirming that insulin crossed the intestioal barrier and was released 
from the microspheres in a biologically active form (Mathiowitz et al., 1997). 
Protein Based Poljmers (PBP): The syndietic gene that codes for a bioelastic PEP was 
designed after repeated amino add sequences GVGVP, observed in all sequenced 
mammalian dastin proteins (Ycfa et al. 1987). Elastin is one ofthe strongest known natural 
fibers and is present in skin, ligaments, and arterial walls. Bioelastic PBPs containing 
multiple repeats of ftis pentamer have remaricable elastic properties, enabling several 
medical andnon-medical applications (Uiiy et al. 1993, Uiry 1995, Daniell 1 995). GVGVP 
polymers prevent adhesions following surgery, aid in reconstructing tissues and delivering 
drugs to the body over an extended pmod of time. North Am^can Sdence Associates, Inc. 
reported that GVGVP polymer is non-toxic in mice, non-sensitizing and non-antigenic in 
guineapigs, andnon-pyrogeoic in rabbits (Urry et al. 1993). Researchers have also observed 
that inserting sheets of GVGrVP at the sites of contaminated wounds in rats reduces the 
number ofadhesionsdiat form as the wounds heal (Urry etaL 1993). In a similar manner. 
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using the GVGVP to encase muscles that are cut during eye suig^ in rabbits prevents 
scarring following tiie operation (Uny et al. 1993, Uny 1995). Other medical applications 
of Uoelastic PBPs include tissue recGnstmction (syn&etic ligaments and arteries^ bones)» 
wound coverings, artificial pericardia, catheters andprogrammeddrug delivery (Ifrry, 199S; 
5 UnyetaL, 1993, 1996). 

We have expressed tiie elastic PBP(GyGVP)t2i in E. cofi(Gudaetal. 199S,Brix^ 
dtaL 1997)>inthefungasilspergi/&^ni^^ 1997), in cultured tobacco cells 

(Zhang et al. 199S), and in transgenic tobacco plants (Zhang et aL 1996). In particular, 
(GVGW)i2i hasbeen e^qpressed to such high levels in£l co/ithatpolym^ inclusion bodies 

10 oociqnedi9toabout90%ofthecell vdume. Also, inclusion bodies have been obsoved in 
chloroplasts of transgenic tobacco plants (see attached article Daniell and Ouda, 1997). 
Recently, we rq>Qrted stable transformation of die tobacco chlcuroplasts by integration and 
expression ttie biopolymer goie (EG121), into the Large Single Copy region (5,000 copies 
per cell) or the Inverted Repeat region (10,000 copies per cdl) of the chlorpplast genome 

15 (Gudaetal.,1999). 

PBPasFnrionProtdns: Severalsystemsarenowavailabletosinq>lifyproteinpurifi^ 
including the maltose binding protein (Marina et al. 1988), glutetfiione S-tranferase (Smith 
and Johnson 1988), biotin3dated (Tsao et al. 1996), thioredoxm (Smith et al. 1998) and 
cellulose binding (Qng et al. 1989) protdns. Recombinant DNA vectors for fusion with 

20 short p^tides are now available to effectively utilize aforementioned fusion proteins in the 
pmification process (Smi&etai. 1998; Kim and Raines, 1993;Sue^aL 1992). Recombinant 
proteins are generally purified by affinity dhromatogr^hy, using ligands ^)ecific to carrier 
proteins (Nilsson et aL 1997). While these are useful techniques for laboratory scale 
purification, aGBnity chromatography for lai^scale purification is time consundng and cost 

25 . prohibitive. Therefior^ economical and non-chromatograpbic techniques are highly 
desirable. In addition, a common solution to N-^terminal degradation of small peptides is to 
fuse foreign peptides to endogenous R coli proteins. Early in ttie development of this 
technique, ^-galactosidase (P-gal) was used as a fusion protein (Goldberg and GofT, 1986). 
A drawback of this method was that the P-gal protein is of relatively high molecular weight 

30 (MW 1 00,000). Theiefore, tibe proportion of the peptide product in the total protein is low. 
Another probl^ associated with the large P-gal fusion is early termination of translation 
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(Biimette, 1983;Hall» 1988). This occurred i;i4ien P-gd was used to prodaceluiaian insulin 
peptides because the fusion was detached fiom the ribosome during translation tiius yielding 
inco]npletepq)tides. . OiOierpiDtdns of lower molecular wdgjit proteins ^ve been us^ 
fusion proteins to increase die pepdde production. For exanq)le, better yields were obtamed . 
5 with tize tryptophan syndiase (190aa) fusion proteins (Hall» 1 988; Bumett,1983). 

Accordingly^ one acMevemeait according to ttiis inventicm is to usepoly(GVGVP) as 
a fusion pxotdn to oiable hypor-eaptesston of insulin and accomplish mpid one step 
purification of tiie fusion peptide* At lower tenqieratuies the potyme^ exist as more 
extended molecules ^i4iich» on raising die traq>earature above the tran^on iange» 

10 hydnyhobically fold into dynamic structures called p-spirals that fiirtfacr aggreg^ by 
hydn9hobicassociationtofonntwistedfi]aments(Uzry,1991). Througihe3q>loitationofthis 
reversible property, sm^Ie and inesqpaisive extraction and purification is performed. The 
teoqperature at whidi aggregation takes place (T]) is manipulatedby engineeringbiopolymm 
containing varying numbers of repeals or changing salt concentration QMcPherson et al., 

15 1996). Another gn>i9 has recently demimstrated purification of recombinant p^ 

fusion witii diermally responsive polypeptides (Meyer and ChiDcoti/ 1999). Polymers of 
diff^mt sizes have been synthesizedandexpressedini^. coU. This approach also eliminates ' 
the need for expulsive reagents, equipmrat and time required for purification. 
Cholera Toxin p snbnnit as a fusion protein: Vibrio cholerae causes diarrhea by 

20 colonizing the small intestine and producing enterotoxins, of which the cholera toxin (CT) 
is considered the main cause of toxicity. CT is a hexamedc AB5 protein having one 27KDa 
A subunit which has toxic ADP-ribosyl transferase activity and a nonrtoxic pentamer of 1 1 . 6 
kDa B subunits that are non'<M>valmtiy linked into a very stable douglhnut like structure into 
which the toxic active (A) subunit is ins^ted The A subunit of CT consists of two 

25 fiagments -Al and A2 which are hnked by a disulfide bond. The en^moiatic activity of CT 
islocatedsolelyontheAlfi:agmeat(Gill, 1976). The A2 fragment oftiie A subunit links the 
Al fiagmrat and the B pentamer. CT binds via specific interactions of the B subunit 
pentamo: witii GMl ganglioside, the membrane receptor, present on die intestinal q)itiielial 
cell surface of the host The A subunit is then translocated into tiie cell where it ADP- 

30 ribos^ates the Gs submut of adenylate cyclase bringing about the increased levels of cyclic 
AMP in affected cells that is associated with the electrolyte and fluid loss of clinical cholera 
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(Lebens et al. 1994). For optimal eDzymalic activity, the Al fiagmmtneeds tobe sqiaxated 
tcom the A2 firagment by proteolytic cleave of the main chain and by reduction of the 
disulfide bond linking them ^dsalanos et al.» 1979). 

The Bsqxressioii and asseinbly of CTB in transgenic potato tubers has been iqKjrrted 
5 (Arakawa et al. 1997). The CTB gene including the leader peptide was fused to an 
endoplasmic i^culum Ideation signal (SEKDEL) atfhe 3' end to sequesterlhe CTB protein 
wiAintfaehunoioflfaeQl. The DNAfiagtnentmcodSng the 2l-aniin6 add leader peptide 
of the CTB protebi was retained to direct tfie newly synthesized CTB protein into the lumen 
of the ER. Imnnmoblot analysis indicated that the plant derived CFB protein was 
10 antigenically indistinguishable ficnn the bactedal CTB protein and Oat oUgomeric CTB . 
molecules (Mr SO kDa) were ttie domioant molecular sfpedes isolated fiom transgenic 
potato leaf and tuber tissues. Similar to bactmal CTB, plant derived CTB dissociated into 
monomers (Mr- IS ]d)a) during heat acid treahnent 

IS Enzyme Unkediiimiunosorfoent assay methods iiidic^^ 

protein bound specifically to GMl gpngliosides, the natural nieinbranerecq)tar5 of Cholera 
Toxin. The maximum anK)uiit of CTB protein detected ia auxin induced transgraic potato 
leaf and tub^ issues was approximately 0.3% of the total soluble protein. The oral 
imnumization of CI>-1 mice with transgenic potato tissues transformed with the CTB gene 

20 (administered at weeldy intervals for a month with a final booster feeding on day 6S) has 
also been reported. Thelevelsofs^imi and mucosal anti-cholera toxin antibodies in mice 
were found to generate protective inmmnity against the cytopatfaic effects of CT holotoxin. 

Followmg intraileal injection with CT> the plant immunized mice showed iq) to a 60% 
2S reduction in diardieal fluid accumulation in the small intestine. Systemic andnmcosal CTB- 
specific antibody titers were determined in both serum and feces collected from immunized 
mice by the class-specific chemihimiTiescent RTJSA metiiod and die en^)oint titeis for the 
three antibody isotypes (IgM, IgG and IgA) were determined. 

The extent of CT neutralization in both Vero cell and ileal loop experiments suggested 
30 timtanti-CIB antibodies preventer binding to ceUularQMl-gangU Also^ mice fed 

with 3 g of transgenic potato exhibited similar intestinal protection as mice gavaged with 3 0 



WO01/72»59 nPCT/USOl/06288 

14 



. g of bactoial CTB. Recombinaiit LTB [rLTB] (the heat labile eotecotoxin produced by 
^iterotoxigpnic E, coK) wbich is structurally^ functionally and inunrmologically similar to 
CTBwaseKpressedintraiisgemctol>acco(AxntzenetaLl^ Th^bave 
rq[X)rted 4iat tfae rLTB r^ained its antig^city as shown by iinmixn0precq>itation of rLTB 
5 wtdi antibodies raised to rLTB £tom £1 co/^\ The rLTB protdn was of tfae rigjit molecular 
wdg^t and aggregated to form the pentam^ as confirmed by gel pCTieation 
chromatography. 

CTB has also been demonstrated to be an efifective carriermolecule for induction of 
mucosal immunity to po]^pq)tides to ^ch it is chemically or gmetically conjugated 

10 (McKenzie el al, 1984; Dertd)augh et al, 1993). The production of immunomodulatoiy 
transnmcosal carrier molecules, such as CTB, in plants may greatly improve ttie efScacy of 
edible plant vacdnes(Haqetal, 199S; Thanavala et al, 1995; Mason etal, 1996) and may 
alsoprovidenovel oraltol^ranceagentsfor prev^on of such autoiimnune diseases as Ty^ 
1 diabetes (Zhang et al, 1991), Rhomiatoid ardiritis (Trentham et al, 1993), multiple 

IS sclerosis (Khoury et al, 1990; Miller et al, 1992; Weiner et al, 1993) as wdl as the 
prevention of allergic and allograft rejection reactions (Saveg^ et al, 1992; Hancock et al, 
1993). Therefore, e»]>ressing a CIB-proinsulin fiision is an ideal approadi for oral delivery 
of insulin. 

ChloropIastGeneticEngineertD^: Several eavtronmeatalproblemsrelatedtoplantgen^c 
20 engineering now prohibit advancement of this technology and prevent realizatian of its full 
potential. One such common concmi is the demonstrated ^c^ of foreign genes througji 
pollen disqpersalfixnntransgraic crqpplants to their weedy relatives creating si^^erweeds or 
causmg gene pollution among olher crops or toxicity of transgenic pollen to non-taig^ 
insects such as bntterfHes. The Ug^ rates of gene flow fix>mcarops to wild relatives (as hi^ 
25 as 38% in sunflower and 50% in strawbaries) are certainly a serious concern. Qearly, 
maternal mheritance (lack of chloroplast DNA in pollen) of tfae herbicide resistance gene via 
chloTopIast genetic engineering has been shown to be a practical solution to these problems 
(Danidl et al, 1998). Another common concern is the sub-optimal production ot Bacillus 
thuringiensis (B.t) insecticidal protein or reliance on a single (or similar) B.t. protein in 
30 commercial transgenic crops resulting in B.t resistance among target pests. Qearly, 
different insectiddal proteins ^ould be produced in letibial quantities to decrease the 
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developmmt of resistance. Siu^hhyper-expressicmofanoyelB.tprotem 
resulted in 100% mortality of insects that axe up to 40,000- fold resistant to ofh^ B.t 
proteins (Kota et al. 1999). Therefore, cUcnxyplast genome is an attractive target for 
expression of foreign genes due to its ability to express extraordinarily high levels of foreign 
5 proteins and eEScient containment of foreign genes through matetnal irihoitance. 

When we developed ihc concept of ddoroplast genetic engineering (Daniell and 
McFadden, 1988 U.S. Patmts; Danidl, World Patent, 1999). It was possible to introduce 
isolatedintact(Moroplasts into protoplasts andxegeneratetransg^ 1973). 
Therefore^ early investigations on cUoropIast transformation focused on the developmmt 

10 of in orgmello systrans using intact chloroplasts enable of efiicirat and prolonged 
transcription and translation (DanieD and Reibeiz, 1982; Daniell et al., 1983, 1986) and 
e?q»ression of foreign genes in isolated chloroplasts (Daniell and McPadden, 1987). 
However, after the discovery of tibe gene gun as a transformation device (Daniell, 1993), it 
was possible to transform plant chloroplasts without the use of isolated plastids and 

IS protoplasts. Chloroplast genetic eiigineering was accomplished in several phases^ Transient 
expression of foreign genes in plasdds of dicots (Daniell et aL, 1990; Ye et al., 1990) was 
followed by such studies in monocots (Daniell et al., 1991). Unique to ihe chloroplast 
genetic engineering is the . development of a foreign gene expression system using 
autononunisly rq)licating diloroplast expression vectors (Daniell et al., 1990). Stable 

20 integration of a selectable marker gene into the tobacco diloroplast genome (Svab and 
Malign, 1993) was also accomplished usingthe gene gun. However, useful genes conferring 
valuable traits viachloroplastg^etic engineeringhavebeen demonstrated onlyrecaa^. For . 
exan^le, plants resistant to B.t sensitive insects were obtained by integrating the crylAc 
gene into the tobacco chloroplast genome (McBride et al., 1995). Plants resistant to B.t 

25 resistant insects {up to 40,000 fold) were obtained by hyper-e?q>ression of the cryilA gene 
widiin die tobacco chloroplast genome (Kota et al., 1 999). Plants have also been genetically 
engmeered via the chloroplast genome to confer herbicide resistance and the introduced 
foreign genes were maternally inherited, overcoming the problem of cut-cross with weeds 
(Daniell et al., 1998). Chloroplast genetic engineering has also been used to produce 

30 phannaceuttcal products that are not used byplants(Gudaetal., 2000). Chlorpplast genetic 
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m^eering technology is cuneady betng E^Ued to other useful crops (Sidotov et al. 1999; 
DameU. 1999). 

Polymer-proinsiilin Recombuiant DNA Vectors: Fkst we developed independent, 
dhloroplast vectors for die expression of insoltn chains A and B as polymer fi:^Dn pqitides, 
S as it has been produced in £ coll for conmercialpm^ The disadvantage of 

diis meOiod is that E. coli does not form disulfide bridges in the cell unless the protein is 
taigetcd to the periplasm. E3q)en5ivemvK^assemblyafierpuiificationi5neoessaryfi)rthis 
aiyproach. llierefore, a better approach is to express die human pioinsulin as a polymer 
fu^onproteht Thismediodisbetterbecausechloroplastsareci9>ableofforn^ 

10 bridges. Using a single gei^ as qyposed to die individual chairis^eliixi^ 

of conducttng two parallel vector construction processes, as is needed for individual chains. 
In addition, the need for individual fermentations and purification procedures is eliminated 
by; the single gene method Further, proinsulin products require less processing foUowin 
esctcaction. Anodic benefit of using the proinsulin is diat the C-peptide, vAnxh is an 

IS essential part the proinsulin protein, has recendy been ^own to play a positive role in 
diabetic patimts (Ido et al, 1997). 

Recendy, die human pr&5m>insulin gene was obtained fiom Genentech, Inc. First, 
diepre-proiiisulinwassub-</lonedintopUC19tofaciUtatefui^ Tlienext 
step was to destgn primers to make chloroplast e^iression vectors. Since we are interested 

20 in proinsulin ecq>ression, die 5* primer was designed to land on fheproinsu^ This 
FW primer eluded the 69 bases or 23 coded amino adds of the leader or pre-sequence of 
prq)roinsu]in. Also,die£Qrwaniprimerindiideddieenzyinaticcleavagesi^ 
factor Xa to avoid the use of cyanogen bromide. Beside the Xa-factor, a Smal site was 
introduced to &cititate subsequent subcloning. The ord^ of the FWprimer sequence is Smal 

25 - Xa-factor - Proinsulin gene. The reverse primer includes BamHl and Xbal sites, plus a 
short sequence widi homolgy with the pUC19 sequence foUowing die proinsulin gene. The 
297bp PCR product (Xa Pris) includes three restriction sites, which are the Smal site at the 
5'-aid and Xbal/BamHl sites at die 3' aid of the proinsulm gene. The Xa-Pris was cloned 
into pCR2.1 resulting in pCR2.1 - Xa- Pris (4.21d)). Insertion of Xa-Pris into die multiple 

30 cloning site of pCR2. 1, resulted in additional flanking restriction esayvos^ sites that will be 
used in subsequent sub-cloning stq>s. A GVGVP SO-mer was g^erated as described 
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pieviously (Daniell et al. 1997). The ribosome binding sequence was intcodiiced by 
digesting pUCs-10, which contains tiie RBS sequence GAAGGAG, witti Nool and Hind III 
flanking sites. The plasmid pUC19-S0 was also digested vnUi fhe same cazymcs. The 
SChn^genewasehitedfiomtfaegelandUgatedtopUC^lOtDprodu^ The 
ligation step inserted into die SQmer gene a RBS sequence and a Smal site outside the gene 
to £u:ilitate subsequoit fusion to proinsulin. 

Anoflier Smal partial digestion was perfomifid to eliminate tibe stop codon of the 
Copolymer, transform the SQmer to a 4QEner, and fuse the 4Qmer to tiie Xarproinsulin 
sequence. The conditicms for this partial digestion needed a deorease in DNAcon^ 
and tiie 1:15 dilution of Smal. Once the correct firagment was obtained by tiie partial 
d^estbn of Smal (eliminating the stop codon but include the RBS site)^ it was ligated to tfie 
IQhpromsulin fusion giene resulting in the construct pCR2»l"4(X-XaFris. Finally, the 
biopolymer (4Qmar) - ptoinsulin fiision g^e was subcloned into pSBL-CtV2 (chloioplast 
vector)bydigestiiigbotiiyectorswithXbaL Thenthe fusion gme was ligated to.thepSBLr 
QV2 and the final ye(^r was called pSBLrOOXaPris. Theorientationof the insert was 
chedced with Nool: one the five colonies chosen had the correct orientation of the gene. 
The fusion gme was also subcloned into pLD-CtV vector and tiie orientation was checked 
with EooRl and Pvuil. One of die four colonies bad die correct orientation of the insert 
This vector was called pIJO<)C-XaPris (TFig 

Both chloroplast vectors contain the 168 jdRNApromotar(Ptm)drivingtheselectable 
maik^ gene oadA (aminoglycoside adenyl transferase conferring resistance to 
spectinomycin) followed by the psbA 3' region (the terminator fiom a gene coding for 
photosystem n reaction center components) fiom Ihe tobacco chloroplast genome. The only 
difference between these two chloroplast vectors (pSBL and pLD) is the origb of DNA 
firagments. BotfapSBL andpLD are universal chloroplast e7q>ressibn/integration vectors and 
can be used to transform chloroplast genomes of several other plant species (Daniell et al. 
1998) because these flanking sequences are highly conserved among hi^ier plants. The 
universal vector uses tmA and tml genes (chloroplast transferRNAs coding for Alanine and 
Isoleucine) from the inverted repeat region of the tobacco chloroplast genome as flanking 
sequences for homologous recombination as shown in Figs. 2A and 3B. Because the 
universal vector integrates foreign genes within the InvertedRepeat region of the chloroplast 
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^nome, it should double tbe cofyy number of insulin genes (from 5000 to 10,000 copies per 
cell in tobacco). Furfihennore, it has been dononstzated that homoplasm 
in the first round of selection in tobacco probably because of ihc presence of a chloioplast 
origm of replication within the flanking sequence in &e univeisal vector (thereby providing 
S more tenqplatesf(»r integration). Becaiseofdiese and several other reascnis, foreign gene 
e^qiiession was shown to be much higher wbeaa the universal vector was used instead of ttie 
tobacco specific vector (Guda et aL, 2000). 

DNA sequence of the polymer-proinsulin fusion was d^ermined to confirm ttie 
ccxiect orientation of genes, in fiame fiision and lack of stop codons in the recombinant 

10 DNA. constructs. DNA seiquendng was performed using a Perfcin Elmer ABl prism 373 
DNA S6qu»dng system using a ABIItism Dye Temii^ The 
kit uses AmpliTaq'DNA jMilymerase. Insertion sites at both ends were sequenced using 
primers for each strand Eqxression of all chloroplast vectors was first tested in E. cott 
bef(n:e tfaeir use in tobacco transfinrmation because of the similaiity of protein synthetic 

15 machinery (BriseyetaL 1997). Ygi Escherichia coliGq^ttesAm 

co/j was transfiormed by standard CaCl^ transformation procedures. 
Expression and Purification of the Biopolymer-proinsulin fusion protein: Terrific broth 
growth medium was inoculated with 40//1 of Ampicillin (lOOmg/tol) and 40/zl of the XL-1 
Blue MRF To strain of £1 coli contaming pSBL^C-XaPris plasmid. Similar inoculations 

20 were made for pLD-OOXaPris and the negative controls, which included both plasmids 
containing the gme in the reverse orientation and the E. coli strain without any plasmid. 
Then,24hrcultureswe]:ecentrifiigedatl3,000ipmfor3min. The pellets woe resuspended 
in 500/^1 of autoclaveddH20 and transferred to 6ml Falcon tubes. The lesuspended pellet 
was sonicated, using a High Intensity Ultrasonic processor, for IS sec at an amplitude of 40 

25 and then IS sec on ice to extract the fusion protein fiom cells. This sonication cycle was 
rq[>cated IS times. The sonicated sanq)Ies were transferred to microcentrifuge tubes and 
c«itriiugedat4**Cat 10,00{)gfor lOmmtopurifythefiisi After coitrifugation, 

the supernatant were transferred to miorocentrifudge tubes and an equal volume of 2XTN . 
bufiFa:(100mMTrisHC3,pH8,100mMNaa)wasadded. Tabes were wanned at 42^C for 

30 25 niin to induce biopolymer aggregation. Then the fusion protein was recovered by 
centrifugingat2^00ipmat42XfQr3min. The recovered fusion protein was resuspetuled 
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in lOOpl of cold ^ter. The purification process was repeated twice. Also, Hie fusion 
protein was recovered by using 6M Guanidtne hydrochloride phosphate buffer, pH 7.0 
(instead ofwater), to facilitate stability of insulin. New cultures were incubated for this step 
following the same procedure as described above, except fliat the pSBL-OC-XaPris 
eaqxressingceUs were incubated for 24, 48 and 72 hrs. Cultures were centrifuged at 4,000 
rpm for 12 min and flie pell^ was resuspended m 6M Guanidine hydrodiloride phosphate 
buffer, pH7Aai3i<iti^ sonicated as described above. After sonication,saniples were run 
in a 16.S%Tricine gel, transferred to flie nitrocellulose menihrane, andtnwmmoblottmgwas 
performed &e following day. 

A 15% glycine gel was run for 6h at recommended voltage as shown in Fig. 1. Two 
diffoent methods of extraction were used. It was observed that when tiie sonic extract is in 
6M Guanicine Hydrochloride Phosphate Buffer, pH7.0, ibe molecular wdg^ changes from 
its original and correct MW 24 Id) to a higjis: MW of q>proximately 3 0 U>a (Pig. 1 C. I) . 
This is probably due to the conformation that the biopolymer takes under tins kind ofbuffer, 
wfaidi is used to maximize the extraction of proinsulin. 

The gel was first stained with 03M CUCI2 and then the same gel was stained with 
Ckm]massieR-2S0 Staining Solutionforan hour and &endestainedfor IS minfirst, and then 
ovemig^ CaCl2 creates a negative stain (Lee et al. 1987). Polymer proteins (wi&out 
fusion) appear as clear bands against a blue background in color or daric against a light 
semiopaque background (Pig. 1 A). This stain was used because other protein stains such as 
Coomassie Blue R2S0 does not stain the polymer protein due to die lack of aromatic side 
chains (McPharson et aL, 1992). Therefore;, the observation of the 24 kDa protein in R2S0 
stained gel (Fig. IB) is due to the insuUn fusion with die polymer. This observation was . 
further confirmed by probing these blots with die antihuman proinsulin antibody. As 
anticipated, the polymer insulin fusion protein was observed in western blots as shown in 
Fig. IC, even dioug^ the binding of antibo^was less effideait^nx)bably due to concealm 
of insulin epitopes by die polymer). Largq: proteins observed as shown ia Fig. IC II are 
tetram^ and hexamer complexes of proinsulin. 

It is evident that the insulin-polyer fusion proteins are stable in E. colL Confinning 
this observation, recently anodier lab has shown diat the PBP polymer protein conjugates 
(with thioredoxin and tendamistat) undergo diennally reversible phase transition, retaining 
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tbe tzansitiQn bdiavior of the free polymer (Meyer and Chikott, 1999). These results cleariy 
(kmonstiate lliat insulin fusion has 

of the polymer. One oTiho concecos is the stability of insulin at tenqperatuics used for 
^ennally reversible purification. Temperatuie induced production ofhmnan insulin has 
5 been in obmmerdal use (Schmidt et al. 1999). Also, the temperature transition can be 
lowered by increasing the ionic strmgdi of die solution during purification of tibds PSP 
(McPherson et al, 1996). Thus, GVGVP-fusion could be used to purify a imiltLtnde of 
ecQOomically inqK>rtantprotrins in a sinq^le inexpensive step. 

Biopotymer-iproiiisQiin foAon gene expression in cUoroplast: As described in section d, 
10 pSBLpOC-R40XaPris vector and pL]>-OC-R40XaPri5 vectors bombarded into the 
tobacco chloroplasts genome via particle bombardmCTt (Daniell., 1997). PGR. was 
pez&imedtocanfinnbiopolymer-prom 

ThePCRproductswaneexBminedin0.8%agaro8egels* Fig. ZAshows primers landrog sites 
and expected PGR products. Fig.2Bshowsthel.6kbpPC3lproduct,confinxiingintegration 

IS of the aadA gene into the chloroplast genome. This 1.6Id> product is seen in all clones 
except L9, which is a nnitant We used primers 2P and 2M to confirm integration of both 
the aadA and biopolymer-prorosulin fusion gene. The 1.3 kbp product corresponds to the 
native chloroplast fiagment and the 3.5 kbp product corresponds to the chloroplast genome 
that has integrated all three ganes as shown in Figs. 2C amdD. All the clones examined at 

20 diis time show heteroplasmy, exce[t C3>mes :8d om Fog/ 2Q and S41b in Fig. 2D, which 
^w almost homoplasmy. 

Protease Xa Digestion of the Biopolymer-proinsiilin fusion protein and Purification of 
Proinsulin: Factor Xa was purchased from New England Biolabs at a concentration of 1 .0 
mg/ml. The Factor Xa is siq^pKed in 20mM HEPES, SOOmM, NaCl, 2mM CaClj, 50% 

25 glycerol, (pH 8.0). The reaction was carried out in a 1 : 1 ratio of fusion protein to reaction 
buffer. Thereactionbufferwasniadewith20mMTri&-Ha,100mMNaC^ 
8.0). The enzymatic cleavage of die fusion protein to release the proinsulin protein fiom die 
(GVVP)4o was initiated by adding the protease to the purified fusion protein at a ratio (ww) 
of proximately 1,500. This digestion was continued for 5 days with mild stirring at 4°C. 

30 Cleavage of the fusion protein was monitored by SDS-PAGE analysis. After the cleavage, 
thesameconditionsaieusedforpurificationoftheproinsulmprotei^ Thepurificationstq>s 
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ate die same as for the piuxfication of fhe fusion protein, except ffiat instead of recovering 
the pellet, tibe supernatant is saved. We detected cleaved proinsulin in tiie extracts isolated 
in 6Mguamd]nehydrocU(md6 buffer as show in F^^^ IC 11. Ckniditionscanbeestimized 
for complete cleavage. The Xa protease has been successfully used to deave(C^ 
5 GST fusion (McPherson et aL 1992). Therefore, cleavage of proinsulin fiom GVGVP usmg 
the Xa protease does notpose problems. 

Vector for CTB expression in chloroplasts: The leader sequmoe (jB3 bp) of thenattve CTB 
. gene (VTl hp) tmr deleted and a gtert cndoti (ATG) intmdnced at 5* end of the T emaintng 

CTB gene (309 bp). Pritoersweis designed to introduce a ibs site 5 bases i^stream 

10 startcodon. The S* primer (SSmer) was designed to and on fhe start codon and &e 5'^ 

IheCTBgene. This primer had an Xbal site at the 5*-ffld, the rbs site [GGAGG], a S bp 
breadungspacefottowedbydiefbst20bpof tfaeCTBgene. The 3* primer (32m^) was 
designed to land on the 3* end of the CTB gene and it introduced lestriction sites at die 3 ' end 
to fedlitate subclonmg. The 347 bp rCTB PGR product was subcloned into pCR2.1 

15 rBSultmgmpc(:3EC2.1-TCrB. Thefindstep wasmsertionofrCTBintolheXbal siteoffhe . 
niuver5alortobaccovectQr(pIJB-QAr2)1hataUows the express coli 
and chloroplasts. Restriction en^me digestion of the pLD-LH-iCTB vector with BamHl 
was performed to confirm the correct orientation of the inserted fiagment in the vector. * 
Because of the similarity of protein synthetic machinery, expression of the diloroplast 

20 vector was tested in £ co/i before its use in tobacco transformation. Voi Escherichia coli 
^qnession the XL^l Blue MRF^o strain was used. E. coli was transformed by standard 
CaClj transformation procedures. Transformed E. coli (24 hrs culture and 48 hrs culture in 
100ml TB with lOOmg/ml an^icillin) and untransformedf. coli (24 hrs culture and 48 hrs 
culture in 100ml TB with 12.Smg/ml tetracycline) was then centrifiiged at 10000 x g in a 

25 Beckman GS-15R centrifuge for 15 min. The pellet was washed witii 200mM Tris-Cl twice 
and resuspmded in 500/zl extraction buffer (200mMTris-a,pH8.^^ lOOmMNaCl; lOmM 
EDTA, 2mM PMSF) and then sonicated using the Autotune Series High Intensity Ultrasonic 
Processor. Then, 100;il aliquots of tiie sonicated transformed and untransfotmed cells 
[containing 50 - 100//g of crude protein extract as determined by Bradford protein assay 

30 (Bio-Rad Inc)] and purified CTB (Sigma O9903) were boiled with 2X SDS sample buffo 
andseparatedona 15%SDS-PAGEgelmTri&^ycinebuffer(25mMTris,250mMg^^ 
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pH8.3,0.1%SDS). Thesq[>aialedpn)teinwa5fheiiti:ansfei^ 
by electro blotdng using &e Tians-Blot Electiophoietic Transfer Cell (Bio-Rad Lie). 
Dnmnnoblot detection of CTB eaqpression in£L coU: N(»ispedfic antibod^ieactions were 
blocked by incubation of fhe membrane in 2Sml of S% non-£it dry milk in TBS buffer for 
S 1 -31irson a rotary shaker (40 rpm),foUowed by washing in TBSbnf^^ The 
membrane was thra inrabated for an hour with gentle agitation in 30 ml of a 1 : SOOO dihitton 
of rabbit ant»H;holeraantisenmi(SigmaC-3062) in TBS with Tween-20 [TBST| (contajning 
1% non-M dry inilk) foUowed by washing 3 tones in TBST bu£^^ The membrane was . 
incubated for an hour at room tsmpetataxc with gentle agitation in 30 ml of a 1:10000 

10 dilDtionofinouseantiH:abbitlg<jconjugatedwitha]]ffllinephospt^^ Itwastfaen 
washed thrice with TBST and once witfi TBS followed by incubation in the Alkaline 
Phosphatase Color Development Reagents, BdP/NBT in AP color devdopment Buffer 
(Bio-Rad, be.) for an hour, bntnunoblot analysis snows the presence of ll.S kDa 
polypeptide for purified bacterial CTB and tiansformed 24h/ 4Sh cultures (Fig. 3 A, lanes 2, 

15 3 and 5). The 48h culture appears to express more CTB than that of the 24h culture 
indicating the accumulation of flie CTB protein over time. The purified bacterial CTB (45 
Kda) dissociated into nK>nQmers (1 1 .5 KDa each) due to boiling prior to SDS PAGE. These 
results indicate that the pLD-LH-CTB vector is expressed in E, colL Because of the 
similarity of die E. coli protein synthetic machinery to that of chloroplasts^ cUoroplast 

20 expression of the above vector should be possible. 

CTB expression in chloroplasts: As described below» pUO-LH-CTB was integrated into 
the tobacco chloroplast genome via particle bombardment (Daniell, 1997). PGR analysis 
was performed to ccmfirm chloroplast integration. Fig. 3B shows i>rmi^ landing sites and 
size of expected products. PGR analysis of clones obtained aft^ the first round of selection 

25 was carried oiit as described below. PGR products were examined on 0.8% agarose gels. 
The PGR results (Fig. 3G) show that clones 1 and 5 that do not show any product are 
mutants while clones 2, 3, 4, 6, 7, 8, 9, 10 and 11 ttiat gave a 1.65 kbp product are 
transgenic. As expected, lanes 13 - 15 did not give any PGR product, confirming that die 
PGR reaction was not contaminated. Because primers 3P & 3M land on die aadA gene and 

30 on the chloroplast genome, all clones tiiat show PGR products have integrated die CTB gme 
and the selectable marker into die chloroplast genome. Clones that showed chloroplast 
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integration of Ihe CTB gene were moved to the second round of selection to increase copy 
nmnb^. PGR analysis of clones obtained aStar the second round of selection was also 
carried out PGR results shown in Fig. 3D indicate that clone 5 does not give a 3 kbp 
product indicating diat it is a niutant as observed earlier. Other clones give a strong 3 klq) 
5 product and a fiaint 1.3 kbp (similar to the 13 kbp untransformed plant product) product, 
indicating that they are ttat^^nic but iK>t yet homoplasmic. Complete homoplasmy can be 
accomplished by several more rounds of sdection or by geminating seeds firom transgenic 
plants on 500 figfcd of spectinomycixL 

CIB-ProinsalinVectorC^nstnicti^ lhe(MoroplastexpressionvectorpU><JIB^roh^ 

10 was constructed as follows* Firsts bob prainsulin and cholera toxin B-^bunitgmes were 
ampfified j&om suitable DNA using primer sequences. Primer 1 contains the GGAGG 
chloroplast preferred ribosome binding site five nucleotides iq>stream of the start codon 
(ATG) for the CTB gene and a suitable restriction enzyme site (SpeQ for insertion into the 
chloroplast vector. Prirner 2 eliniiimtes die stop codon arid adcb die first two amino acids- 

15 of a flexible hinge tetrapqptide GPGP as reported by Berg^t et al. (1997), in orda- to 
facilitate folding of the CTB-proinsulin fusion protein. Primer 3 adds die remaining two 
amino adds fortiie hinge tetta-peptide and eliininateslfaepre-seqnence of Ihepre-proinsuIirL 
Primer 4 adds a suitable restriction site (Spel) for subcloning into the chloroplast vector. 
Anq)lrBed PGR products were inserted into the TA cloning vector, Bodi the CTB and 

20 proinsulin PGR firagments were excised at die Smal and Xbal restriction sites. Bluted 
firagments were ligated into the TA cloning vector. Interestingly, all white colonies showed 
die wrong orientation for Cl B insert while three of the five blue coloiues examined showed 
the rigjbt orientation of the CTB insert The CTB-pioinsulin fiagmmt was excised at the 
EcoRl sites and inserted into EcoRl digested dephosphorolated pLD vector. Resultant 

25 onicroplast integration e?q)ression vector, pLD-CTB-Proins will be tested for expression ni 
E. coli by western blots. After confirmation of expression of CTB-protnsuIin fusion in E. 
coliy pLD-CTB-Proins will be bombarded into tobacco cells as described below. 
Optfanization of fusion gene expression: It has been reported that foreign genes are 
expressed between 5% (crylAC,cryllA) and 30% (uldA) in transgenic chloroplasts(DanieU, 

30 1999). If the e:?qnession levels of the CTB-Proinsulin orpolymer-proinsulin fusion proteins 
are low, several approaches will be used to enhance translation of diese proteins. In 
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cUoroplast, transcriptiaaal regulation of g^e expression is less infportant, aldiough some 
modolations by liglit and develc^ental condittons are observed (Cohen and Afoyfield, 
1997X KNAandprotdnstabilitya|)peartobelessinq>^^ 

accunadation of foreign proteins (e.g. GUS vp to 30% of total protein) and ^1 transcripts 
5 16^66-foldhig^dtan11ielug}ilye3q>re$sing nuclear tr^ Qiloroplast gene 

expression is regulated to a large extmt at ttiepost4ranscri^ For example, 5" 

UTRs are used for optional translation of dUoroplastm^ Sluno-DeIg9mo (GGAGG^ 
sequmces as well as a stem-loop structure located 5* adjacent to the SD sequence are used 
for effident translation. A recent stuify has shown ttiat insertion of the psbA 5' UTR 

10 downstream of the 16S tRNA promoter enhanced translation of a fordgn gene (GUS) 
hundred-fold (Bibl et al. 1999). Therefore, tibe 8S-bp tobacco chloroplast DNA firagment 
(1S9S - 1680) containing S' psbA UTR will be anq>lified using fhe following prhners 
cctttaaaaagccttccattttctattt, gccatggtaaaatct^ggtttatta. This PGR product will be inserted 
downstream of die 16S rSNA promoter to enhance translation of the proinsulin fusion 

15 proteins. 

Yet another approach for enhancement of translation is to optimize codon 
compositions of these fusion protein. Since both fusion proteins are ex|nressed well in £1 
coUf we expected efBcieat expression in chloroplasts. However^ optimiziag codon 
compositions of proinsulin and CTB goties to march fhe psbA gene could further enhance 

20 the level of translation. Although rbcL(RuBisCO) is fhe most abundant protein on earth,, 
it is not translated as frequendy as the pshA gene due to the extremely high turnover of die 
psbA gene product The psbA gene is under stronger selection for increased translation 
efSciency and is the most abundant thyiakoid protein. In addition, codon usage in hig^ 
plant chloroplasts is biased towards the NNC codon of 2-fold degen^te groups (i.e. TTC 

25 over TTT, GAC over GAT, CAC ovrar CAT, AAC over AAT, ATC over ATT, ATA etc.). 
This is in addition to a strong bias towards T at third position of 4-fold degenerate gjcoups. 
There is also a context effect that should be taken into consid^tion while modifying 
specific codons. The 2^fold degenerate sites immediately upstream fit)m a GNN codon do 
not show this bias towards NNC, (TTT GGA is preferred to TTC GGA while TTC CGT is 

30 preferredtoTTTCGTTTCAGTtoTITAGTandTTCTCrtoTTTTCr). Inaddition, 
Mg^yexpressedcMonsplastgqiesuseGNNmorefirequendythanotherg^^ Thewd>site 
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may be used qytimize codon conq)osition by comparing different ' 

^edes. Abundance of amino adds in chloioplasts can be taken into consideration 
(pathways con]3)arixnentaIiz6dinplasti^ 

S As far as die biopolymer gsne is concerned, we observed inconq)lete translation 

products in plastids when we eiq)ressed liie 12(hner^e(Gndaetal. 2000). Theiefore, 
while expressing Hio polym^-proinsulin fusion ptotein, we decreased die length of die 
polymer protem to 4Qmer, witfiout losing the thermal responsive property. In addition, 
optimal codons for ^ycine (GGT) and valine (GTA), which constitute 80% of die total 

10 amino adds of the polymer, have been used. In all nuclear encoded genes gfycine make vp 
147/1 000 amino adds i^^e in tobacco chloroplasts it is 129/1000. Hig^lyexpressinggenes 
likepsbA and ibcL of tobacco make np 192 and 190 gly/1000. Therefore, glydne may not 
be a limiting factor. Nuclear genes use 52/1000 proline as opposed to 42/1000 in 
diloroplasts. However, currcntlyusedcodonforproline(CCXj) can bemodifiedtoCCA or 

IS CCT to fiuiher enhance translation. It is known that padiways for proline and valine are 
compartmentalized in cbloroplasts (Guda et ai. 2000). Also, proline is known to accumulate 
in chloroplasts as an osmoprotectant (Danidl et al. 1994). 

Bombardmrat and R^enaration of Chloroplast Transgenic Plants: Tobacco {Nicotiana 
tabacum var. Petit Havana) and nicotine firee edible tobacco (LAMD 665,, gift £rom Dr. 

20 Keith Wycoff. Planet Biotechnology) plants are grown aseptically by germination of seeds 
onMSOmedrum. TBQs medium contains MS salts (4Jg/^tBr),B5 vitamin ini?itu^ 
inositDl, 100mg^liter;11iiamine-HCl. lOmg/liternicotinicacid. 1 mg/literjpyridoxine^HCL. 
1 mg/liter), sucrose (30 gi^liter) and phytagar (6 g/liter) at pH 5.8. Fully expanded, dark 
green leaves of about two month old plants are used for bombardment 

25 Leaves are placed abaxial side iq> on a Whatman No. 1 filter paper laying on die 

RMOP medium (Daniell, 1993) in standard petri plates (100x15 mm) for bombardment 
Tungsten (1 ^an) or Gold (0.6 /zm) micropiojectiles are coated with plasmid DNA 
(chloroplast vectors) and bombardments carried out with the biolistic device PDSIOOO/He 
(Bio-Rad) as described by Daniell (1997). Following bombardment, petri plates are sealed 

30 widi paraiilm and incubated at24^C under 12 hphotoperiod. Two days after bombardment, 
leaves are chopped into small pieces of '^S mm^ in size and placed on die selection medium 
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(RMOP cantajniDg SOOus/tnl olspcctisxamycin dihydrochloride) with abaxial side touchmg 
the medhnn m deep (100x25 nun) petri plates (-^10 pieces per plate). The Tegmerated 
q)ectmQmydn resistant shoots are dioppcd into small pieces (~2mra^ and sabcl(med into 
fresh deeppetriplatBs('*^5 pieces perplale) containing the sam Resistant 
S shoots fixmi ttie second culture cycle aifoe transferred to the rooting medium (MSO medium 
supplemented widi IBA. 1 mg^ter and spectmomycin dihydrochloride, 500 mg/Uter). 
Rooted plants are transferred to soil and grown at 26^C under continuous ligjbting conditions 
fiar further analysis. 

Potymerase Chain Reactimi: PGR is pafotmed using DNA solated fixmi control and 

10 trani^^c plants to distinguish a) true chloroplast tmnsformants fiom mutants and b) 
cUoroplasttransforniantsfix>m mcleartramforinants. Primos for testing the presence of 
the aadA gene (that coniers spectinomycln resistance) in transgenic pants are landed on the 
aadA coding sequence and 16S rRNA gene (primers IP&IM.). To test diloroplast 
integration of the insulin poe, one primo^ lands on the aadA gene, while another lands on 

IS &e iiative cUoroplast genonie (priiiieis 3P&3&40 as shown in Figs. 2A and 3^^ No PGR 
product is obtained with nuclear tnmsgenic plants using this set of primers. The primer set 
(2P & 2M^ in Figs. 2A and 3B) is used to test integration of the entire gene cassette without 
mtenial deletion or looping out during homologous recombination. A similar strategy has 
been used successfidly to confimi chloroplast integration of forngn genes (Daniell et aL, - 

20 1998;Kotaetal^ 1999; OndaetaL, 1999). This scroening is essential to eliminate mutants 
and nuclear transf ormants. 

Total DNA fiom unbombarded and transgemc plants is isolated as described by 
Edwards et al.^ (1991) to conduct PGR analyses hi transgraic plants. PGR reactions are 
p^ormed in a total volume of 50 iil containing approximately 10 ng of teQq>late DNA and 

25 I /iMof eachprhnerinamixtDreof 300;^ofea<^deaxymicle^^ 

Tris (pH 8.8), 100 mM KCI, 100 mM 0^)2804, 20 mM MgS04 , 1% Triton X-100, 1 
mg/ml nuclease-fiee BS A and 1 or 2 units of Taq Phis polymerase (Strat^ene^ La Jolla, 
CA). PGR is carried out in the Perkin Ehner's GeneAmp PGR system 2400, by subjecting 
the samples to 94°C for 5 min and 30 cycles of 94X for 1 mm, SS^'C for 1.5 min, 72*'G for 

30 1.5 or 2 min followed by a 72^C step for 7 min. PGR products are analyzed by 
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electn^horesis on 0.8% ag3T0se gels. Chloropldst transgenic plants ccnitaimng the 
proinsolm gene are ften moved to secxmd round of selection to adiieve homoplasmy. 
Soutliern Blot Analysis: Southemblotsaiepeifonnedtodetemai^ 
introduced foreign gene per cell as well as to test homoplasmy. There are several thousand 
copies oftiiechloroplast genome present in each plant cell. Therefore^ when foreign genes 
are inserted into ihe chloroplast genome, it is possible that some of 4ie (ddoroplast genomes 
have foreign genes integrated while others remain as die wild type (teteroplasmy). 
Therefore, to ensure diat only die transf oomed graome exists in cells of transgenic plants 
(homoplasmyX the selectian process is continued. To confirin that the wild type g^iome 
does not exist at the end of the selection cycle^ total DNA fiom transgenic plants should be 
probed widi the cUoroidast bcnrdcr (flanking sequences (the ttnl-tcnA fiagmenl; Figs* 2A 
andSB). If wild type genomes are present (heterc^lasmy), the 
obsm^ along with transformed glomes. Presence of a large fiagment (doe to insertion 
of foreign genes within ttie flanking sequences) and absence of the native small ftagment 
confirms homoplasn]^ CDaniell et al.» 1998; Kota et al., 1999; Guda et aL, 1999). 

The copy number of the mtegrated gene is determined by establishmg homqplasmy 
form die transgenic chloroplast gmome. Tobacco chloroplasts contain 5000-10,000 copies 
of dieir genome per cell (Daniell et al., 1998). If only a fraction of the genomes are actually 
trai^formed, the copy nmnber, by defsudt, must be less than 10,000. By estabfishing diat in 
die trangenics the insulin inserted transformed genome is the only one present, one can 
establish that the copy number is S000~10,000 per cdL This is usually achieved by 
digesting the total DNA widi a suitable restriction enzyme and probing with the flanking 
sequences that enable homologous recombination into the chloroplast graome. The native 
fragment present in the control should be absent in the transgenics. The absence of native 
fragment proves that only the transgmc chloroplast genome is preset in the cell and ttiere 
is no native, untransformed, chloroplast genome, without the insulin gene present This 
establishes the homoplasmic nature of the transformants, simultaneously, diereby providing 
an estimate of 5000^10,000 copies of the foreign genes per cell. 

Total DNA is extracted from leaves of transformed and wild type plants using the 
CTAB procedure outlined by Rogers and Bendich (1988). Total DNA is digested with 
suitable restriction fflzymes, electr(q;>horesed on 0.7% agarose gels and transferred to nylon 
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membranes (Micron Separation Inc., Westboro, MA). Probes are labded with ^-dCTP 
using fte landomrprimed procedure (Promega). Fro-hybridization and hyfaridizatiQn stq>s 
are carried out at 42^ for 2 h and 16 h» respectively. Blots arc soaked in a sohition 
containing 2X SSC and 0,5% SDS for 5 win followed by transfix to 2X SSC and 0. 1% SDS 
solution for IS min at room tempearature. Then, blots are incubated in hybridization botdes 
containing O.IX SSC and 0.5% SDS sotaition for 30 nun at 3TC followed by anodi^r step 
at eS^'C for 30 nun» with gende agitation. Finally* blots are briefly rinsed in O.IX SSC 
sohition, dri ed and exposed to X-ray film in the daik. 

NorflimiBIot Anatysis: Nordiemblotsareperforinedtotest1faeefBden<^oftranscrqrtion 
of the prDirisulingp:ie fused with CTB or polyinergene^ Total KNA is isolated fixnn 150 
mg of fix>zen leaves by usuig (he '^leasy Plant Total RNA Isolation Kif (Qiagen Inc., 
Chatsworth, CA). RNA (10- 40 ing) is denatured by foimalddxyde treatment, separate 
a 1.2% agarose gel m ^ presence of formaldehyde and transferred to a nitrocellulose 
membrane (MSQ as described in Sambrook et al. (1989)* Probe DNA Qproinsulin gene 
coding r^on) is labeled by the random-primed method (Promega) with ^-dCT isotope. 
The blot is pre-hybiidized, hybridized and washed as described above for soufhem blot 
analysis. Transcrq>t levels are quantified by the Molecular Analyst Program using the GS- 
700 Imaging Deositometar (Bio-Rad, Hercules, OA). 

Pol^mer*insullnfiiisionprotdnpurification,quanfitationandch^ Because 
polymer insulin fusion proteins ediibif inverse temperature transition properties as shown 
in Figs. 1 A and B, Ihey are purified firom transgenic plants essentially following the same 
method for polymer purification fix)m transgenic tobacco plants (Zhang et aL, 1996). 
However, an additional step is mtrodaced to take advantage of die compartmentalization of 
iiisulinpolymerfusionproteinwithinchloroplasts. Ghloroplastsarefirstisolatedfixancrude 
homogenate of leaves by a simple centrifugation step at ISOOXg. This eliminates most of 
die cellular organelles andproteins(Daniell at al., 1983, 1986). Then, cUoroplasts axe burst 
open by resuspending them in a hypotonic buffer (osmotic shock). This is a significant 
advantage because there are fewer soluble proteins inside chloroplasts when compared to 
hundreds of soluble protdns in the cytosoL Polymer extraction buffer contains 50 mM Tris- 
HCl, pH 7.5, 1% 2-mecaptoeflianol SmM BDTA and 2mM PMSF and 0.8 M NaCL The 
homogenate is then c^itrifuged at 10,000 g for 10 min (4^C)> and the pellet discarded. The 
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supernatant is incubated at for 30 minutes and tiien cmtrifbged immediately for 3 
minutes at 5,000 g (room ten]p«:atiire). Ifinsnltnisfound1obesettsitivetofliistenq>eratu^ 
Tt is lowered by increasing salt concentration (McPherson et al., 1996). Tbe pellet 
containing the insnlin^lymer fusion protein is resuqiended in tbe extraction buffer and 
incubated on ice for 10 minutes. Hie mixture is centrifuged at 12^000 g for 10 minute (4°C), 
The sqiematant is then collected and stored at -20^C. The purified polym^ insulin fiision- 
projtein b electrppboresed in 'a SDS-PAGE gd according to Laemml (1970) and visualized 
by eidier staining wift 0.3 M CuCl^ (L* ^ aL> 1987) or transferred to nitrocellulose 
membrane and probed wifli antiserum raised against tbe polymer or insulin protein as 
described below. Quantification ofpurified polymer proteiim may then be c^ 
densitometry. 

After electrophoresis, protdns axe transfened to a nitroceltulose meoihrane 
dectropboreticanyin25mMTris» 192mMgtyc]ne»5%nietfam^^ Thefilteris 
blocked with 2% dry milk in Tris-buffered saline for two bouts at room temperature and 
stained with antiserum raised agafaist the polymer AVGVP (kindly provided by ftie 
Umversity of Alabama at Birmingham, monoclonal facility) overnight in 2% dry milk/Tris 
buffered saline. The protein bands reacting to the antibodies are visualized using alkaline 
phosphatase-linked secondary antibody and the substrates nitroblue tetrazolium and S- 
bromo-4~chIoro-3-indolyl-phosphate (Bio-Ra^. Alternatively, for insulin-polymer fusion 
proteins, a Mouse anti-human proinsulin (IgGl) monoclonal antibody is used as a primary 
antibody. To detect the binding of the primary antibody to the recoihbinant proinsulin, a 
Goat anti-mouse IgG Horseradish Peroxidase Labeled monoclonal antibody (HPR) is used 
ThesubstrateusedforcoigugationwitfaHPRis3^VS,S*-Tetramet^ Allproducts 
are available from American Qualex Antibodies, San Qemente, CA. As apositive control, - 
human recombinant proinsulin from Sigma may be used This human recombinant 
proirisulin was «qxressedin£co/i by a synthetic proiiisulingm^ Quantification of purified 
polymer fusion proteins is carried out by d^isitometry using Scanning Analysis software 
(BioSoft, Ferguson, MO) installed on a Macintosh LC in computer (Apple Computer, 
Cupertino, USA) with a 1 60-Mb hard disk operating on a Sj^em 7. 1 , connected by SCSI 
int^ace to a Relisys RELI 2412 Scanner (ReUsys, Milpitas^ CA). Total protein contents is 
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then detemiined by Ihe dye-binding assay using reag^ts suf^fied in kit fio Bio-Kad, wifh 
bovine s&cvm albumin as a standard 

Characterizatton of CTB expression: CTB protein levels in transgenic plants are 
detennined using quantitative EUS A assays. A standard curve is geneiatBd using known 
concentrations of bacterial CTB. A 96-weU mictotiter plate padded vnth 100 A^lAvell of 
bacterial CTB (concentratLons intfaerangs of 10 - 1000 ng) is incubated ovemigjit at 4X. 
The plate is washed fldce wiQi PBST (j^s^iab^ buffered saline containing 0.05% Tweenr 
20). Ibe badcground is blocked by iiu:ubation in 1% bovine 

(3001MelQat37Cfor2lLfoUowedbywashing3timesvrithPBST. Theplate is incubated 
ina 1:8,000 dihition of rabbit antiH^loca toxin antibody (100/^eIQfor 
2 hat 37^ foUowed by washing file wells three ttinesviath PBST. The plate is incubated 
wiflia l:80,000d]liitionofantiHrabbitIgG conjugated with al^^ 
for2hat37XandwashedtfaricewifliPBST. Then, 100 Ml alkaline phosphatase substrate 
(Sigma Fast p-nitrophen^ phosphate tablet in 5 nd of water is added and the reaction 
stopped wifh IM NaOH (50 ^VwoU) when absorbancies in the mid-rangp of ttie titration 
reach about 2.0, or after 1 hour, whichever conies first The plate is 1h» read at 40Snm. 
These results are used to generate a standard curve fiom which concentrations of plant 
protein can be extrtqpolated. Thus, total soluble plant protein (concentration previously 
detsnnined usmg the Bradford assay) in bicaibonate buffer, pH 9.6 (15 nM Na2Co3, 35niM 
NaHCOj) is loaded at 1 00 plant /il/well and &e same procedure as above can be repeated. 
The absorbance values are used to determine the ratio of CIB protein to total soluble plant 
protein, using the standard curve generated previously and the Bradford ass^ results. 
Inheritance of Introduced Fore^ Genes: In initial tobacco transfixmants, some are 
allowed to self-pollinat^ whereas otiiiers are used in reciprocal crosses widi control tobacco 
(transgenics as female acceptors and pollen donors: testing for maternal inheritance). 
Ilarvested seeds (Tl) are germinated on media containing spectinomyciiL Achievement of 
homoplasmy and mode of inheritance can be classified by looking at gennination results. 
Homoplasmy is indicatedby totally green seedlings (Daniell etal., 1998) while heteroplasmy 
is displayed by variegated leaves (lack of pigmentation, Svab & Maliga, 1993). Lack of 
variation in chlorophyll pigmentation among progeny also underscores tiie absence of 
position effect, an artifact of nuclear transformation. Matanal inheritance may be 
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demonstrated by scie transmission of introduced genes via seed generated on transgenic 
plants, leganUess of poUen soiin^ (green seedlings on Whentransg^c 
poUen is used for polfinatton of control plants, resultant progeny does not contain resistance 
to chemical in selecthre media (will appear bleached; Svab and Molecular 
5 analyses confirms transmission and expression of introduced genes, and T2 seed is generated 
finm Aose confirmed plants by the analyses described abova 

Comparison of Corrent Purification with Polymer-based PurificatioB M ettiods: It is 

inQX)rtant to compare puiificatian mediods to test yield and pnrity of insulin produced in R 
cott and tobacco. One liter of pSBL containing bacteria is grown m LB/anpiciBin (100 

10 . /ig^) overnight and the fusion protem expressed are harvested by centriiiig^on at - 
5000 X g for 10 min at 4^C» and tiie bacterial pellets resuspended in 5 ml/g (wet wt 
Bacteria) oflOOmMTns-HCl,pH 7.3. Lysozyme is added at a concentration of 1 mg^ 
and placed on a rotatmg shaker at room tenq[>»:atare for IS n^ The lysate is subjected to 
probe sonication for two cycles of 30 s an/30s off at A'^C. Cellular debris is removed by 

IS ' centrifugationat 1000XgforSininat4^C bisulin polymer fiision protein is purified by 
inverse tenq)erature transition properties (Daniell et al., 1997). Alternatively, the fusion 
protdn is piuified according to Cowl^aodMacldn (1997). Hie siq)ecnatant is retained and 
centrifuged again at 27000 X g for IS min at 4^C to pellet the inclusion bodies. The 
superoatantisdiscardedandtiiepell^resuspendedin 1 ml/g (original wt Bacteria) of dH20, 

20 aliquoted into microcentrifuge tubes as 1 ml fiactions» and &ai centrifuged at 16000Xg£>r 
5minat4*'C The peUets are individually washed widi 1ml of 100 mMTri^O^pH 
1 M urea, 1 - 1 Triton X-1 00 and again washed with 1 0 0 mM Tris HQ pH8.S, 2 M urea, 2% 
Trinton X-100. The pellets are resuspended in 1 ml of dHaO and transferred to a pro- 
weighted 30 ml Corex centrifuge tube. The sample is centrifuged at ISOOO X g for S mm at 

2S 4*'Q and the pellet resui^ended in 10 ml/g (wet wt pellet) of 70% formic add Cyanogen 
bromide is added to a final concentration of 400 mM and the sample incubated at room 
tenqieiature in the dark for 16 h. The reaction is stopped by transferring the sample to a 
round bottom flask and removing the solvent by rotary evaporation at SO^C The residue is 
resuspraded in 20 ml/g (wet wt pellet) of dH20, shell firozen in a dry ice ethanol bath, and 

30 thenlyophilized. The lyophilized protein is di^lved in 20 ml/g (wet wtpellrt) of 500 rnM 
Tris-HCl, pH 8^ 7 M urea. Oxidative sulfitolysis is performed by adding sodium sulfite 
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and sodium tetratliionate to final concentrations of 100 and 10 noM, tespectiyely; and 
incubating at]XM>mtenq)eratarefor3 This reaction is tiim stopped by freezing on dry ice. 

PnrificatioD and folding of Hmnan Proinsnlin: The S-sulfonated material is qypUed to a 
5 2 ml bed of Sq>badexG-2S equilibrated in 20 mMTris-HCl,pH 8 

wasfaedwitii9volsof7Murea. ThiecollectedfiractionislfaenqypliedtoaPhaimaciaMono 
Q HR S/S column equilibrated in 20 mM Tris-HC^ 

ml/min. A linear gradient leading to find concei£:ationof03MNaQ is used 

bound matoiaL 2 min (2 ml) finctions are collected during tiie gradient and proton 

10 concentration in each fi:action detennined. Purity and molecular mass of firactions are 
estimated by Tridne SDS-PAGE (as shown in Fig. 2), where Tricihe is used as die trailing 
iontoallowbetterresohitionofpeptidesinfherangeof 1*1000 kDa, Appropriate firactions 
are pooled and ^lied to a 1.6 X 20 cm column of Sq)hadex G-2S (superfine) equilibrated 
in 5 mM ammonium acetate pH 6.8. The sanqile is collected based on UVabsorbance and 

IS fireeze-dried The partiEdly purified S-sutfonated material is resuspmded in SO mM 
glycine/NaOH, pH 10.5 at a final concentration of 2 mg/ml. P-mer-captoethanol is added - 
ataratio of 1.5 molp^mol of cysteine S-sulfonate and the sample stirred at 4^C in an open 
containerfor 16h. The sample is then analyzed by rcYersed-phasebigh-perfoniiaiic 
cfaromatogr^hy (RP-HPLQ using a Vydac C4 column (2.2 X 150 mm) equilibrated in 4% 

20 acetDnitrileaiid0.1%TFA. Adsorbedpq[)tides are elutedwitii a Unear gradient of increasing 
acetonitrite concentration (0.88% per min iq> to a mairimiim of 48%). The remaining 
refolded proinsulin are centrifoged at 16000 X g to remove insoluble material, and loaded 
onto a semi-preparatiye Vydad C4 column (10 X 250 mm). The bound material is then 
eluted as described above, and the proinsulin collected and lyophilized. 

25 Analysis and characterization of insulin e^qpressed in K coU and Tobacco: The purified 
expressedproinsulinissubjectedtoinatrix-assistedlasardesorptioii^onizatiori^^ 
(MALDI-TO?) analysis (as described by Cowley and Macldn, 1997), ushig proinsulin fixm 
EU Lilly as both an internal and external standard A proteolytic digestion is performed 
using Staphylococcus aureus protease V8 to determine if the disulfide bridges have formed 

30 correctly naturally inside chloroplasts or by w W/no processing. Frsre (ig of both the 
expressed proinsulin and EU Lilly* s proinsulin are lyophitized and resuspended in 50 a^I of 
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256]]iMNaP04 pH7.8. Protease V8 is added at a ratio of 1:50 (w/w) m e3q>e^ 
sanq^les and no eo2yme added to the controls. AU saiiqyles are then inoibated overnight at 
the reactions stoi^ed by fieezmg on dry ice, and sanq)les stored at -20X until 
analyzed. The san^)lesai£ analyzed by RPrEIPIX]!using a VydacQ 
5 ^ equilihratedm4%ac^onitriteand0.1%TFA. Bound material is tfieneluted using a linear 
. gradient of increasing acetonilrile conc^itration (0.88% per min up to a maximum of 48%). 

CrB-€a\ilg9Jiglioside binding assay: A GMl-ELISAassayisperfonned as described by 
Axakawaetal. (1997) to d^ennine the afBnity of plant-derived CTB for(941-g^ngUoside. 

10 The miczotiter plate is coated with nionosialogan£^osic&-GMl (Sigma G-7641) by 
incubating the plate with 100 ^l/well of GMl (3.0 Mg/ml) in bicarbonate buffer, pH 9.6 at 
4''C oveimgbt Altemativety, the wells are coated with 100 /zl/well of 6SA (3.0 //g^) as 
controL The plates are mcubated with transforoied plant total so^ 
CTB (Sigma C-9903) in PBS (100 lil/well) overnight at 4%L The remainder of flie 

15 procedure is then identical to the ELISA described abova 

Mouse feeding assays for CTB: This is performed as described by Haq et al. (1995). 
BALB/c mice, divided into groups of five animals each, are &sted overnight before feeding 
them transformed edible tobacco (that tastes like spinadi) expressing CTB, untransformed 
edible tobacco and purified bacterial CTB. Feedings are performed at weekly intervals (0, 

20 7, 14 days) for dnee weeks. Aniinals are observed to confirm complete consumption of 
material. On day 20, fecal and serum sanq[)les are coUectedfirom each aiiimal fox ana^ 
of anti-CTB antibodies. Mice are bled retro-oibitally and the samples stored at "20^C until 
assayed. Fecal sanq^les are collected and fiozen ovenug^ at -70^C, lyophilized, 
resuspended m 0.8 ml PBS (pH72) containing 0.05% sodium azide per 15 fecal pellets, 

25 centrifiiged at 1400Kg for 5 min and the supematant stored at -ZO'^C until assays Sanq)les 
are then serially dilute in PBS containing 0.05% Tween-20 (PBST) and assayed for anti- 
CTB IgG in SGVsm and anti-CTB IgA in fecal i)ellets by the ELISA method, as described 
earlier. 

30 Assessment of diabetic ^^ptoms in NOD mice: The incidence of diabetic synoptoms is 
con]pared among mice fed with control nicotine fiee edible tobacco and diose that express 
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tJieCTB-proinsulinfusimproteixL Four week old female NOD ixdce are divided 
groups, each group consistiDg often mioe. Each groiq> is fed with control or transgenic 
edible tobacco (nicotine firee) esqxicssing the CTB-proinsulin fusion gene. The feeding 
dosage is deteamoined based on the level of expression. Starting at lOwedcsofage, the mice 
are monitore^ on a biweeldy basis with urinary ghicose test strips (Clinistix and Diastix, 
Bayer) for development of diabetes. GlycosuricmicearebledfiomtfaetailveintDdiedcfQr 
glycCTiia using a ghicose analyzer (Accu-Cabieck, Bodnmg^ \fannheim). Diabetes is 
confinned by hyperglycmiia (>250 mg/dl) for two consecutive we^ (Ma ^ aL^ 1 997). 
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EXPRESSIONOF THE NATIVE CHOLERA TOXIN B SUBTJNIT GENE 
AS OUGOMERS IN TOANSGJENIC TOBACCO CHLOROPLASTS 

FIELD OF THE INVENTION 

This inv^tioii rdafes to expression of native cholera toxin B subnnit gene as ' 
oli^mieis in transgeaic plant chlotoplasts^ particulaily, in transgmc tobacco cbJonqrfasts. 

BACKGROUND 

Phannacologically inqrartant proteins are increasingly being expressed in plants as 
an economical attotnative to convmtional protein production m^ods. Transgpnk plants 
expressing recombinant proteins and biologically active peptides sncb as vaccines, growth 
fectors, hormones, monoclonal antibodies and enzymes have been rqx>rted (1). 

Proteins from difTerant sources of a wide range have been expressed in nucleiar 
transgenic plants. Protein accumnlaticm levels of recombinant enzymes, like pbytase and 
xylanase were Ug^ in nuclear transg^c plants (14% and 4.1% of total soluble tobacco leaf 
protein respectively). Thismaybebecausetheirenzyintatictiatureniiadediemmoreresistant 
to proteolytic degracbttoiL Mostrmdeartraiisgenicplants express low levelsofiecondniiant' 
protein offauman» viral or bacterial origni. Human protefns are cfxpreg«ed at levpJs Tanging 
firom as low as 0.000017% of fieahwei^ (human 0 interferon expressed in tobacco) to a 
higji of 0. 1% of soluble seed protein (human enkq)halin ^pressed in aiabidopsis seeds; 2). 
The NcMrwalk virus cap^ protein expressed in potatoes caused oral immunization when 
consumed, although food oqnression levels were low, maximizing at 03% of total soluble 
protein (3). 

SUMMARY OF THE INVENTION 

This invention includes expression of native chol^ toxin B subunit gene as 
oligomers in transgenic tobacco diloroplastswhiclimay be utilized in connection witii large- 
scale production of purified CTB, as well as an edible vaccine if expressed in an edible plant 
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or as a transmocosal cacri^ of peptides to vMdk it is fused to eilber enhance mucosal 
immunity or to induce oral tolerance of fhe products of these peptides. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Fig. 1. pLD-LH-CTB vector and PGR analysis of control and chloroplast 

transformants. A. The perpendicular dotted tine shows the vector sequences that are 
homologous to native chloroplast DNA, resultmg in homologous recombination and site 
specific integration of the gene cassette into tiie chloroplast genome. Primer landing sites 
are also shown. B. PGR analysis: 0.8% agarose gel ofPCR products usmg total plant DNA 

10 as tenq>late. 1 kb ladder (lane 1); Untransformed plant (lane 2); PGR products with DNA 
tenqplate firom transgenic lines 1-10 (lanes 3 

Fig. 2. Western blot analysis of CTB ^qpiession in E.coli and chloioplasts. 
Blots were detected using rabbit anti-cholera serum as primary antibody and alkaline 
phos[^taselabeledmouseanti-rabbitIgGassecondaryantibody. A. E.coliprotein analysis: 

IS Purified bacterial CTB, boiled (lano 1); Unboiled 24 h and 48 h transformed (lanes 2 & 4) 
and untransformed (lanes 3 & S) B. coli cell extracts. Plant protein analysis: B. Color 
Development detection: Boiled, untransformed protein Qano 1); Boiled, purified CTB 
antigen (Im^b 2): Boiled^ protein from 4 different transgenic lines (lanes 3 - 6). C. 
ChemiTuminescent detection: Plant protein- Untransformed, unboiled (lane 1); 

20 Untraiisfoimed,boiled(lane2);Transg^cliiies3&7»boiled(la^ 

3, unboiled (lane 4); Purified CTB anti^n boiled (lane 6), unboiled 0ane 7); Marker (lane 
8X 

Fig. 3. SoulhCT blot ana^is of Tq and T] plants. A. Untransformed and • 
transformed chloroplast genome: TransformedanduntransformedplantDNA was digested 
25 with Bgin and hybridized with the 0.81 kb inx>be diat contained the chloroplast flanking 
sequences used fear homologous recombmation. Southern Blot results of To lines (B) 
Untransformed plant DNA (lane 1); Transformed lines DNA (lanes 2-4) and Ti lines (C) 
Transformed plant DNA Qsam 1-4) and Untransformed plant DNA (lane 5). 

Fig. 4. A. Plant phenotypes; 1 : Confirmed transgenic Kne 7; 2: Untransformed 
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plant B. 10-day-old seedliqgs of T, transfoimed (1, 2 & 3) and untransfomied plant (4) 
plated on S(Khng^ spectinoniycm selec^on med^^ 

Fig. 5. A. CTB ELIS A quantification: Absorbanceof CTB-antibodty con^Iex 
inknown concentrations of total sotubleplantprotem was compared to absorbance ofknown 
5 concentiation of bacterial CIB-antibody conqilex and the amount of CTB was expressed as 
a percentage of tibe total soluble plant protenL Total soluble plant piotein from young, 
mature and old leaves of transgenic lines 3 and 7 was quantified* B. CTBGM iGanglioside 
binding ELISA assays: Plates coated first with GMj gangliosides and BSA req>ectively, 
we^ plated with total soluble plant protein fiom lines 3 and 7, untransfonned plant total 
10 soluble protein andpurified bact^al CTB and the absorbance of the GMigangUoside-CTB- 
antibody con^leK in each case was measured. 

DETAILED DESCRIPTION 
Bacterial antigens like the B subunit proteins, CTB and LTB, which are two 

IS di^cally, structurally and inmmnologicaily similar candidate vaccine antigens of 
prokaryotic enterotoxinsy have hem expressed in plants. CTB is a candidate oral subunit 
vaccine for cholera tiiat causes acute wat^ dianhoea by colonizing the small intestine and 
produdngtiieenterotoxtn, cholera toxm (CI). Cholera toxin is a hexamericABs protein 
consistmg of one toxic 27 kDa A subunit having ADP ribos^ transferase activity and a 

20 nontoxicpentamerof lL61d[>aBsQibunit5(CTB)tiiatbindsto j^dlitates 
its entry into tiie intestinal qntfadial cells. CTB when administered orally is a potent 
mucosal immunog^ wMdi can neutralize the toxicity of the CI holotoxm by preventing 
it fixnn binding to the intestinal cells (4). This is believed to be a result of it binding to 
eukaryotic cdl surfaces via Gmi gangliosides, receptors present on the uxtestuial epithelial 

25 sur&ce, eliciting a mucosal immune response to pathogens and enhancing the unmune 
response ii^en chemically coupled to otiier antigens (5,6) 

Native CTB and LTB genes have been expressed at low levels via the plant nucleus. 
Since, both CTB and LTB are AT-rich con^)ared to plant nuclear genes, low expression was . 
probably due to a number of factors such as aberrant mRNA splicing, mRNA instability or 
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. inefiSd eat codon usage. To avoid these undesirable features synthetic '^plant optizmzed" 
genes encoding LTB were created and expressed in potato, lesnttmg in potato tubers 
expressing up to 10-20 AAgofLTB per grain firesh weight (7). However, extensive codon 
modification of gmes is laborious, expensive and often not available due to patmt 
restrictions. One of the consequences of these constitutively expressed high LTB levels, was 
the stunted growth of transgenic plants that was eventually overcome by tissue specific 
expression in potato tubers. The mnTOmiTn amount of CTB protein detected in auxin 
induced, nuclear transgenic potato leaf tissues was ^^proximately 03% of &e total soluble 
leaf protein wh^ the native CTB gaie was fiised to an endoplasmic reticulum retenticm 
signal, Hhm targeting the protein to the endoplasmic reticulum for accunmlation and 
assembly (8). 

Bicreased expression levels of several proteins have been attained by e^iressuig 
foreign proteins in chloroplasts of hi^er plants (9 - 11). Human somatotcopin has been 
expressed in diloroplasts widi yields of 7% of the total soluble protem (12). The 
accumulation levels of tiie Bt Ciy2Aa2 op&xm in tobacco chloroplasts are as high as 46.1 
%of the total soluble plant prot^(l 3). This higjb level of e^qxression is attributed to the 
putative db^eroningyOrfl and orf2» upstream of Qry2Aa2 in Ibeoperon that nwyl^ 
fold the protein into a caystaUme fomi that is stable and resistant to proteolytic degiadatimL 
Besides the ability to express polycistrons, yet another advantage of diloroplast 
transformatiQn I, is the lack of lecanibinant protein expression in pollen of cUoroplast 
transgenic plants. As there is no (UoroidastDNA in pollen ofmost crops, pollen inediated 
oulcross of recombinant genes into the egyironment is minimized (10 - 15). 

Since the transcriptional and translational machine of plastids is prokaryotic in 
origin and the AT. ^oAoccum (Mirq[>last genome has 62.2% AT contend it 
native CTB genes would be efiBciently ^qpocessed in this organelle wiflioat&eneed for codon 
modification. Also, codon comparison of the CTB gene with psbA, the ms^ot translation 
product of fte chloroplast, showed 47% homology with the most firequent codons of the 
psbA gene. Highly expressed plastid g^es display a codon adaptation, which is defined as 
a bias towards a set of codons which are complimentary to abundant tEtNAs (16). Codon 
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analysis showed that 34% of the codons of GIB are complhnentaiy to ttie tRNA population 
inthecUoraplastsinconqKirisonwittiSl %ofpsbAcod(His thataceconxplimeDtarytotfae 
chloioplast tiRNA population. 

Also, stable incorpoTation of tbe CTB gene into tfie ptecise location l>etween the tmA 
and tml genes of the chloioplast genome by homologous recombination, should eliminate 
Ihe ^position effect^ firequenfly observed in nuclear transgenic plants. TTiis should allow 
uniform expression levels in diff^nt tiansgienic lines. Amplification of die 
transgene, should result in a high level of CTB gene expressian since each plant cell contains 
up to 50,000 copies of flie plastid g^ocne (17). Another significant advantage of the 
production of CTB in chloroplasts, is the ability of chloroplasts to form disulfide bridges 
(12,18,19) which are necessary for die correct folding and assembly of the CTB pentam^ 
(20). 

Ih this study, we report the integration of the CTB gene into the inverted vepeat region 
of the tobacco chloroplast genome, allowmg 2 copies/ chloroplast g^meof the CTB gene 
per cell, resultirig in chloniplastsaccuinulating high levels This eliminates die need 

to modify the CTB gene for qptimal esqiression in plants. 

CdnstracdonoftheCUoropIast Expression Vector pIA-CTB: The leader sequence (63 
bp) of the native CTB ^e was deleted and a start codon was introduced at the 5* end 
PrimessweiodedgnedtoiritroduceBnrbssiteSbasesupstreamoftfaestartco^ TheCTB 
PCK.productwasdienclonedinlofteinuItq>lecloningsiteoftte vectDr(&ivitrogen) 
and subsequmdy into tibe chloroplast esqiression vector pLD-CtV2 using suitable restriction 
sites. Kestriction enzyme dig^ons of Ihe pLD-LH- CTB vectcnr were done to confirm the 
correct orientation of Ihe inserted fi^gment 

Expression of diepIJ>-IJI-CTBvectorwas tested mJ?. co&XLr-l Blue MRF^ strain 
before tobacco transformadon. E. coU was transformed by standard CaCl; transfimnation 
procedures. Transformed £ coli Q4 and 48 hrs culture in 100ml TB with 100 /zg/ml 
anqpicillin) and untransformed E, coli (24 and 48 hrs culture in 100 ml TB with 12.5 /zg/ml 
tetracycline) were centrifiiged for 15 min. The pellet obtained was washed with 200mM 
Tris-Cl twice, resuspended in 500 /A extraction buffer (200mM Tris-Cl, pH 
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8.0, lOOmM Nad, IQmM EDTA, 2mM PMSF) and somcated To aliquots of 100 fil 
transfinmed anduntransfonned sonicates [contaioing SO - 100 fig of crude protem extract 
as (fetemuned by Bradford protem assay (Bio-rad)] and ptaified CTB (100 ng, Sigma), 2X 
SDS sample buffer was added. These sample mixtures were loaded on a 15% sodiinnSDS 
5 -PAGE gel and electrophoiesed at 200v for 45 min. in Tris-glycine buffer (25mM Tris, 250 
mM glycine, pH 83, 0.1% SDS). The separated protein was transfeired to a nitrocellulose 
membrane by electroblotting at 70v for 90 

Immiinoblot Analy^ of CTB Production in £1 colii Nonspecific antibody reac^ons w^ 
blocked by incubation of the m^nbrane in 25 ml of 5% non-fat dry milk in TBS buffer for 

10 2 h on a rotary shaker (40 rpm) followed by washing in TBS buffer for 5 min. The 
membrane was incubated for Ih in 30 ml of a 1 :5000 dilution of rabbit anti-cholera antiserum 
(Sigma) in TBST (TBS with 0.05% Tween-20), containing 1% non-fat dry milk, followed 
by washing thrice in TBST. jbicubation for an hour at room ten^>eratiire in 30 ml of a 
1:10,000 dilution of alkaline pIioslq)hatase ccmjugated mouse anti-rabbit IgG. (Sigma) in 

IS TBST» wasdiing thrice in TBST and once with TBS was followed by incubation in tiie 
Alkaline Pho^hatase Color Devdopment Reagents, BCnVNBT ui AP color developmrat 
buffer (Bio-Rad) for an hour. 

BombardbnentandRegenerationofCUoroplastTransgenicPlants: Fully expanded^daik 
green leaves of about two-monOi old Nicotiana tabacum var. Petit havana plants were 

20 placed abaxial side iQ) on filt^ pq)ers in RMOP (21) p^didi plat^ Micrqnrqjeetaes 
coated with pLD-LH-CTB DNA were bombarded into the leaves using the biolistic device 
PDS1000/He(Bio-Rad), as described by Daniell (21). Followu3ginciibationat24''Chitiie 
dadc for two days, the bombarded leaves woie cut into small (-5mm^ pieces and placed 
abaxial side up (5 pieces/^Iale) on selection medium (RMOP containing 500 vas/L 

25 spectinomycin dihydrochloride). Spectinomycin resistant shoots obtained after about 1-2 
months were cut into small pieces (~2mni^ and placed on the same selection medium. 
PCR Analysis: Total plant DNA from putative transgenic and untransformed plants was 
isolated using the DNeasy kit (Qiagen). PCR primers 3P (5'AAAACCCGT(XTCAGT 
TCGGATTGC-S") and 3M (5'-CCGCGTTGTrrCATCAAGCCTrACG-30 were used for 
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FCR on putative transgenic and untrax^fonned plant total DNA. Samples were camed 
through 30 cycles using the following temperature sequence: 94**C for 1 niin, 62*C for 
l.Sininand72Xfor2nun. C^les were preceded by denaturotion for 5 min at 94''C. PGR 
confirmed shoots fiom the second selection were transferred to rooting medium (NfSO 
5 medium coiitaining 500 mg/L spectinomycin). 

Southern Blot Analysis: Tenmicrogramsoftotal plant DNA (isolated using DKeasy kit) 
per saiiq>le were digested with Bgl U, sq)arated on a 0.7% agarose gel and transfened to a 
nylon membrane. A0.8]d>firagmentprobe, homologous to the diloroplastbord^ sequences, 
was generated when vector DNA was digested wiABgin and BamHI. Hybridization was 
10 performed using flie Ready To Go protocol (Pharmacia). Southern blot confirmed plants 
w^ transferred to pots. On flowering, seeds obtained bom Tq lines were germinated on 
spectinomycin dihydrochloride-MSO media and Tj seedlings were grown in bottles 
containing MSO with spectinomycin (500 mg/L) for 2 weeks. The plants were later 
transfored to pots. 

15 Western Blot Analysis of PlantProtein: Transformed anduntransformed leaves (100 n^) 
were ground in liquid nitrogm and resospended in 500 /il of extraction buffer (200mM Tris- 
a, pH8.0. 100 mM Nad, lOmM EDTA, 2 mM PMSF). Leaf extracts (100 - 120 fzg as 
determined by Lowry assay) were boiled (4 min) and unboiled in reducing sano^ bufifer 
^ioRad) and electrophoresed in 12% polyacrylamide gels using &e buffer system of 

20 LaCTimli(22). The sqntated proteim were transferred to a nitrocellulose nie^ 

electr6blottingat85vforlh. The inmmnoblot detection procedure was similar to lhat done 
for £ coH blots described above. For tlie chemihimmescmt detection, the S. Tag™ AP 
Lumiblot kit (Novagen) was used. 

EUSA Quantification of CTB: I>iSemA concentrations (100 /il/well) of 100 mg leaves 
25 (transformed and untransfonned plants) ground with liquid nitrogm and resuspeoded in 
bicarbonate buflfer, pH 9.6 (15mM NajCOj, 35mM NaHCOj) were bound to a 96 well 
polyvinyl chloride microliter plate (Costar) overnight at 4**C. The background was blocked 
wift 1 % Bovme serum albumin (BSA) in O.OIM phosphate buffered salme (PBS) for 2h at 
37^C, washed 1hricewittiwashingbuffer,PBST (PBS and0.05%Tween20) and rabbit anti- 
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cholera seram dihxted 1 : 8,000 in PBST containing 03% BS A was added and incubated for 
2h at 37*C. The wells were washed and incubated wifli 1:50,000 mouse anti rabbit IgCr- 
alkaline phosphatase conjugate in PBST containing 0.5% BSA for 2h at ST'^C Hie plate 
was developed with Sigma Fast piNPP substrate (Sigma) for 30 minutes at room temperature 
5 and the reaction was ended by addition of 3N NaOH and plates were read at 405 mn. 

GMiGanglioside Binding Assay: Todeterminetheai5nityofcMon>plastderivedCrBfor 
GMrgangliosidea, microliter plates were coaled with monosialoganglioside-GMi (Sigma) 
(3.0 /ig/ml in bicaib. buffer) and incubated at 4Xovenu[gJiL As a control, BSA 
(3.0 Azg/ml in bicarb, buffer) was coated on some wells. The mtbUs were blocked wifli 1% 
10 BSA in PBS for 2h at 3TC, washed thrice with washing buffer, PBST and incubated with 
dilutions of transformed plant protein, untransformed plant protein and bacterial GIB in 
PBS. Incubation ofplates with priinary and secondary antibody'dihidons and 
similar to the CTB EliSA procedure described above. 

pLD-LH-CTB vector constraction and R coU expression: The pUVLH-GIB vector 
15 integrates the genes of interest into the inverted repeat regions of the chloroplast genome 
between the tml and tmA genes. Integration occurs tihtrough homologous recombmation 
events between the tml and tmA chloroplast border sequraces of &e vector and the 
coireqMmdmg homologous sequences of die ddoroplast genome as shown in Fig. lA. The 
chimaic aminoglycoside 3* ad^iyltiansferase (aadA) gene that confers resistance to 
20 spectinomycin-str^tomycin and tihe CIB gro^ downstream of it are drivm by the 
constitutive promoter of the rKNA qperon (E^m) and transcription is terminated by the 
psbA3* untranslated ri^(m. Since die protein syndiettcmachiiieiy of cUoroplasts is su 
to that ofE. coti (23X GTB e^qiression of the pLD-LH-CTB vector in E. coli was tested. 
Western blot analysis of sonicated JSl coli ^ole cell extract showed die presence of 1 1 kDa 
25 CIB monomers, similar to that obtained when purified commercially available CTB was 
treated m the same manner as shown in Fig. 2A Oligomeric expression of CIB was not 
observed in E. coli, as expected, due to the absence of a lead^ pq)tide sequence present in 
the native CTB gene diat directs the CTB monomer into the periplasmic space allowing for 
concentration and oligomeric assembly. 
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Sdeetion and R^raeration of Itansgenic Plants: Bombarded leafpieoes when placed 
on selection medium comtimied to grow but were bleached. Gre&x shoots m&cgcd from iho 
part of the leaf in contact with the mediunL . Five rounds of bombardment (5 leaves each) 
resulted in 68 indq)endent transformation events. Each such transgenic line was subjected 
to a second round of antibiotic selection. These putative transfonnants were subjected to 
PGR analysis to distinguish fitmi nuclear traiisformants a^ 
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Determination of Chloroplast Integration and Homoplaany: PGR and Soufhem 
b^ridization wm used to determine int^cation of die CTB gene into the chlotoplast 
Smome. Primers^ 3P and 3M» designed to confiim incoiporation of ttie gene cassette into 
the chloioplast genome were used to screai putative transgenics initially. The primer, 3P, 
landed on the chloroplast genome outside of the chloroplast flanking sequence used for 
homologous recombination as shown in Fig. 1 A. The primer, 3M, landed on the aadA gene. 
No FOR. product should be obtained if foreign genes are integrated into the imdear genome ' 
or m mutants lacking die aadA gene. The presence of the 1.6kb PGR product in 9 of the 10 
putative transg^cs screened, conlSrmedfhe site^speciGc int^^ation of the gene cassette into 
the diloroplast genome. Database searches ^owed that no random priming took place as 
both the 3P and 3M primers showed no homology with other gene sequmces. This is 
confirmed by the absence of PGR. product in untransformed plants (Fig. IB). Similar 
strategy has bem used successfully by us in order to confirm chloroplast integration of 
. foreign genes (1344>24,2S). This screening is essential to ^ittwipatft mutants and nuclear 
transformants and saves space and labor of maintaining hundreds of transgenic lines. 

Southern blot analysis of three of the PGR positive transgenic lines was done to 
finther confirm site specific integration and to establish copy niunber. In the chloroplast 
genome^ BgQI sites flank the chloroplast border sequrac^ of 1 6S rKNA and 3 ' of the tmA 
region as shown in Fig. 3A. A 6.17kb firagment firom a transformed plant and a 4.47 kb 
fingment &om an untransformed plant were obtained when total plant DNA firom 
transformed and untransformed plants was digested with Bgl n. The blot of the digested 
products was probed with a ^ random primer-labeled 0.81 kb tml-tmA fiagment The 
probe hybridized with die control giving a 4.47 kb fi»gm^ as expected, while for die 
transgenic lines a 6.17kb fi»gment was observed, indicating that all plastid genomes had the 
gene cassette inserted between die tml and tmA regions. The absence of a 4.47 khfiragm^t 
in transgenic lines indicates that homoplasmy has been achieved, to the detection level of 
a Southern blot These results e?q>ldin the high levels of CTB obsCTved in transgenic tobacco 
plants. Soutiiem blot confirmed plants transferred to pots were sem to have no adv^e 
pleiotropic e:ffects when compared to imtransfoimcd plants as shown in Fig.4A. Southern 
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blot analysis of T, plants in Fig. 3C shows fliat all 4 liansgemc lines analyzed maintained 
homoplasniy. 

InimimoblotAnalysisof CMoroplastSynthesi^ Anti-cholera toxiBantibodicsdid 
not show significant ccoss-ieaction with tobacco plant protein as can be seen in Fig. 2 C> 
lanes 1 & 2. Boiled and imboiled leaf homogenates were tun on 12% SDS PAGE gels. 
Unboiled cUoraplast synthesized CTB protein appeared as conq)act 45 kl>a oligom^ as 
shown in Fig. 2C» lane 4 similar to the unboiled, pentameric bacterial CTB which sppearcd 
to have partially dissociated into tetramers, tritn^ and monomers upon storage at 4''C oy&c 
a period of several months fiom Fig. 2C, lane?. 
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While heat treatmeait (4 mm. boilingl prior to SDS PAGE of peatameric bacterial 
Cl'B, gave CTB monomefs predomiiiantly, with some protem in the dimeric and trimeric 
fbnn as shown in Fig. 2C, lane 6, chloroplast synthesized CTB dissociated into dimers and 
trimers only, when subjected to similar heat treatment as in Fig. 2C, lanes 3 & S. These 
results are different from the heat induced dissociation of potato plant nucleus synthesized 
CTB; oligomers into monomers (8). A probable reason for this stability could be a more 
stable conformation of chloroplast syn&esized CTB which maybe an added advantage in 
stor^ and administratian of edible vacdnes. Leaf homogenates from four different 
transgenic plants showed afanost similar ejcpression levds of CTB protdn (see Fig. 2B). 
This suggests very litfle clonal variation of CTB e3q)ression, as was con£amed later by 
EUSA quantification assays. Consistmit expression levels ofrccombinantproteiiis in plants 
(as obtained for CTB in this research) maybe essential for production of edible vaccines in 
plants. 

EUSA Quantification of CTB Egression: Comparison of the absorbance at 405nm of a 
known amount of bacterial CTB - antibody complex (linear standard curve) and that of a 
known conceattation of transformed plant total soluble protein was used to estimate CTB 
e?q)ression levels. Optimal dilutions of total soluble protein from two transgenic lines were 
loaded in wells of the microliter plate. As reported previously (8), it was necessary to 
optimize &e dilutions of total sohjble protdn, as levels of CTB protem d^ected varied witti 
tiie concentration of total soluble protefai, resulting in too high concentrations of total soluble 
protein inhibiting the CTB prot^ from bindmg to die wells of the plate. Both Tq lines 
yielded CTB protem levels ranging between 3.5% to 4.1 % of flie total sohible protem (40 
/ig of chloroplast synthesized CTB protein in 1 mg of total soluble protem) as shown in Fig. 
SA. Also, estimation of CTB protein expression levels from differ^ stages of leaves - 
young, mature and old determined that mature leaves have the highest levels of CTB protein 
e^^ression. This is in accordance with the results obtained when siniilar experiments were 
performed when the Bt Cry2aA2 gene was expressed wilhouttfae putative chq)eronin genes^ 
but contrary to results with the Bt Cry2aA2 operon, which showed hi^ expression levels 
in older leaves, probably due to the stable crystalline structure (13). 
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GM]Gfln|^oside£LISABindiiig Assays: BotfacUon3pIasts}nalhe$i2edaQdba(rte^ 
demonstrated a stnmg a£Bmty for GMl, * gangliosides (see Fig. SB) radicating that 
chloh>plast synflicsized CIB cons^ed tlie antig^ 

pentamertodxepratasacdiarideGMiL TheGM, binding abiUty also suggests {Mroperfolding 
of CiB molecules resulting in the pentameric structure. Since oxidation of cysteine residues 
in the B subunits is a preroquisite for in vivo formation of CTB peatam^ (20), prop^ 
folding is afiirdier confirmatian of the ability of chloroplasts to form disulfide bonds. 

Hig^ levels of expression of CTB in transgenic tobacco did not affect growdi rates^ 
flowering or seed settmg as has been observed in diis laboratory, unlike previously r^rted 
for the synthetic LTB gene, constitutivdy e^qiressed via tiie nuclear genrane (7). 
Transformed plant seedlings were green in color while untransformed seedlings laddnglfae . 
aadA gene were bleached white as shown in Fig. 4B when germinated on antibiotic medium. 

The potential iiseofthis technology is diree-fold While, it can be used for larg^scale 
production of purified CIB , it can also be used as an edible vaccine if expressed in an edible 
plant or as a transmucosai carrier of peptides to which it is fused to, so as to eillier enhance 
mucosal inimunit/ or to induce oral tolerance to the products of these pq)tides (S). Larg&- 
scale production of purified CTB in bactraia involves the use of expensive fmnentation 
tedmiqnes and stringent purification protocols Q€) makmg fliis a prohibitive^ expensive 
tedmology for developing countries. The cost ofproducing 1kg of recornbinant protein 
tran^enic crops has been estimated to be 50 times lower than the cost of pnxlucing the same 
amount by K coli fennentation, assmning that recombinant protein is 20% of total E,coli 
protem (27). I1nis» isolation and lysis of CTB producing chloroplasts from cfaloroplast 
tnuisfijonedplantB could serve as aoost--ejOrective means of m 

If used as an edible vacdn^ a selection schone eliminating the use of antibiotic resistant 
genes should be developed. One such scheme uses the beteine aldehyde dehydogoiase 
(BADH) gene, which converts toxic betaine aldehyde to nontoxic glycine betaine, an 
osmoprotectant(28). Also, several other 
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strat^es have been proposed to elintiiiate antibiotic-iesistaiit genes firom transgenic plants 
(29). 

Transgenic potato plants that synthesize a CTB-insulin fusion protein at levels of up 
to 0. 1% of Ae total soluble tiiber protein have been found to show a substantial reduction 

5 in pancreatic islet inflanunation and a delay in the progression of clinical diabetes (30). This 
may prove to be an effective chnical approach for prevmtion of spontaneous autoimmune 
diabetes. Since, increased CTBe^^ression levels have been shown to be achievable via ttie 
chbroplast g«iome through this research, expression of a CTB-proinsulin fusion protein in 
the cUoroplastsofedible tobacco (LAMD) is currently being tested in our labo^ While 

10 existing expression levels of CTB via the chloroplast genome ate adequate for commercial 
exploitation, levels can be increased further (about 10 fold) by insertion of a putative 
chaperonin, as in the case of the Bt Cry2aA2 operon, (13) which likefy aids in the 
subsequent purification of recombinant CTB due to crystallization. 
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> Expres&j» 




Giiy*s 13 monoclonal antibody > 



»kDH 




Fignre 6: 12% reducing PAGE. CbfanOiiiiiiittsceirt 
detection witi) rabbit aixti-choIciE scnon and AP 
lab^ed mouse anti-fabbit IgG 0) antibodiBS. 
Untnmsfoimed, boiledTI) and unboiled (2); Tiaua fi pun edy 
boiled (3&5) and unboiled (4);PUrified CIB boiled (6)and 
unboiled C^; Marker (8). 

• HSA Nadear transformation of potato plants , 
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Figure?: A, B) reducing gels, lanariccrs, 2:Transgenic 
extract showing expression of light (A) and heavy chain 
(B) in cfalorpplasts, 3: Untransforme^ 4: Hninmi IgA. Q 
non-redncing gel l.Transgenic extract shcFwing assembly, 
2: Untiansfiirraed, 3: Human IgA. Biota A & C were 
detected with AP caiuugated ^oat aoti-hnman kappa 
antibody. Blot B was detected with AP oonjugpted gpat 
anti-lnuDan IgA antibody. 




ReLilive 
Ireqaency * 

(^) 40.00 



figure 8: Western Blot of transgenic potato tubers, 
D&rec 30 )ig of tuber prot^ was loaded per ]ane - 
probed with anti-HSA antibody. I : wild Npe; 2: 40 " 
pure HSA; 3*8:difBrent trangenic uoes, 
di feren t levels of expression. 



• Expression of HSA by chloroplast 
vectors hkK colL - 
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0 0.02 OM ao« o.ot OJ OJl Q.U O.U 0.18 0^ 
% HSAr is tbe total solobie protein* 
Figure 9: Frecucncy histogram indudiqg percentage 
Kennebec and D£sir6e transzenic plants expressbg 
different HSA levels. Kesiws are sfaown as Hie 
percentages of transgenic plants (vexttcai ajds) tliat 
express a specific level of HSA of the total sohible 
IHOtein 0iQn2ontal axis). 

- Codon composition and expression levels> 



Flgura 10: Western Blot of £. caH protein 
extracts. 1: 50 ng pore HSA; 2: molecular w^£^ 
marker, 3: pLD-ffiA (control witfiout RBS); 4: 
PLD- 5*UTR-HSA; 5: ptD-RBS-HSA; 6: pLD- 
0RF1+2-HSA; 7: E. cdi without pLD vector. 
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Table 1: Unmodified native codon compositkm and exprcsslcm 
levels observed in transgenic chloroplasts. See section d) for 
details of AT content, %p^A optimal codons and % of codons 
Oat Toatcb the q> tRNA pool. TSP: % total sohible protein 
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in traiir^ ffftn^g gM oroplaste 
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Flgmpe 1: Ci>2A protdn ooooeatiBtiQa d rt ram i a ed by EUSA ia 
HansgGotc leaves. >tote 100-fold increase in pnxtmn ac ia i i inilat i nn m 
the pmenoe of tt» putative ckaperoDiD, ORFZ 



Figure 2 -JnmuDOgoid hbded electron microscopy 
of mature transgenic {eaf. Oy2Aa2 crvstals ia a 
transgenic chloro^last ocpiessing the czy2A 
opefVHL 



► EroTcsaion of a small C22bsl) peptide in transgenic chloroplasts, 

KoassayP. aeruginosa 




Transgenic Uatraosfbrmed 

Figure 3 . Leaves were infected with 10 pi of 8x10^, 
8xl0*,«xl03 and ftclO^ceHs of/*, syringae. Photos 
WCTB taken 5 days after inoculstion. 1-2 )xg of 
antiinicrobial peptide CAMP) is required to kill 1000 
bacterial cells. Local concentration st the site of 
miection is estimated to be 200-800pg AMP. 




Transgenic WIdtypo Buffer only 

Figure 4. Total plant protein was mixed wifli*5|i) of mid- 
log phase bacteria froni ovemigbt culture incubated for 2 
hoars at25°C at 125rpm and grown in LB broth ovemig^ 
Based on mmiminn inhibitory concentration of 1-2 |ig 
AMP/1000 bacterial cells, the expression level was 
calculated to be 2 1 S-AWq of the total soluble protem. 



> RTpwi5Min n of OKgomeric form fcKrolfide bonded^ CTB in transgenic chloroplasts. 
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transgenic line 3 transgenic Hne 7 



Trans.l ne3 
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Figure 5: A) CTB SUSA quatification is shown as a percentage of die total soluble plant protein. Total soluble 
plant protehi from young, mature and old leaves of transgenic lines 3 and 7 was qoannfied. B) CTB-GMl 
GangUoside bniding BLISA assays: Plates coated first with GMl gangliosidcs and BSA were plated widi total 
sohible plant protein from lines 3 and 7» nntransfa r m ed plant total scMuble protein and purified bacterial CTB. The 
ateorbanceortheGMl gangiioside-CTB antibody canqilex was measured. 
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PRODUCTION OF BnOMAN SERUM AT ,mTMTN 
IN TRANSGENIC TOBACCO 

5 FIELD OF THE INVENTION 

This invention relates to production of higji value phannaceutical proteins in nuclear 
transgBDic plants, particularly to productioa of fauman semm alhumm in tzansgesxic tobacco. 

BACKGROUND 

1 0 Human senmi albumin (jEIS A) is a monomenc globular protein consisting of a single^ gpnemlly 

nongtyoos^ated» polypeptide dbain of 585 amino adds (66.5 KDa and 17 disulfide bonds) with no 
postranslationalmodi&ations. ftiscon^Kisedofdireestructurallysimilarg^obulardomainsan^ 
disulfides are positioned in repeated scries of nine Ipop-link-Ioop structures ccatmd around ei^t 
sequential Cys-Cys pairs, HSA is initially syndiesized as pre-pro-an)unim by die liver and released 

1 S fiomlbe endoplasmatic r^culum aftertemoval of the aminotenninal prq>^tide of 18 amino adds. 
Thepro-albuxninisfiirfiierprocessedinfheGolgicoi^ 

of Aepropq;>tid6 are cleaved by a serine proteinase (1). This results in tbe secretion of Ifae mature 
polypeptide of 585 amino acids. HSA is encocted by two codominant autosomic allelic genes. HS A 
belongs to die multigene^unilyofproteins that inchidealpha-fetoprotein and human group-specific 
20 conqponent(Gc) or vitaminD-binding family. HSAfacilitatestransferofmanylig3ndsacax)SSorgan 
drculatoiyintex&ces such as in the Uv^» intestine^ Iddney and brain. In addition to blood plasma, 
senim albumin is also found in tissues. HSA accounts for about 60% of the total protein in blood* 
senna The concentration of albumin is 40 mg/ml in the seamn of human adults. 

25 Medical applications offfiSA: Theprtmaiy foiicdonofHSAisdieniamtaianceof coUo^ 

pressure (COP) widnn the bloodvessels. Its abundance makes it an important detenninant of die 
phaxmacoldnetic behavior of many drugs. Reduced synthesis of HSA can be due to advanced Hver 
dise^e^inqiairediritestinal absorption ofnutrients or poor nutritional intake. Increased albumin 
losses can be due to kidney diseases (increased glommilar penneability to macromolecules in die 

30 nq>hrottc syndrome^ intestinal diseases QyrotBinrlosing enteropathies) or exudative aktn disorders 
(bums). CataboUc states such as chronic infixtions» sq>sis, surgety, intestinal resection, trauma or 
extensive buniscanalsocausel^poalbunmLeniia. HSAisusedindiBE^ofbloodvoIumedisozdecs> 
for example posthacmorrfaagic acute faypovolacmia or extensive bums, treatment of ddiydration 
states, and also for cinhotic and hepatic illnesses. Itis also used asan additive in perfusion liquidfibr 

35 extraoorporealdiculation. HSAisusedclinicallyforreplacingbloodvolume,butaIsohasavariety 
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of noi>-fliarq)eiitic iises^ mchidmg its role as a 

HSAis a stabilizer for biological "?fttwriflk iniotare andis i]sedfbrpiq>aringbiological standards and 
ref^^nce materials. Furtfaormore, HSA is fieqaaitly used as an experimental antigen^ a cell-culture 
constitDeni and a standanl in climcal-cheoiistry tests. 

5 

Expression systems for HSA: Ilie expression and pioiQcation of recombinant HSA frc^ 
niicrpocgamsms has been rqiorted previously (2-6). Saccharon^ftxs cereinsiaeh^JxmmoA^ 
prodoce HSA bolb intraodtalary, requiring denatnration and refolding prior to analysis (7X and by 
secretion (S). Secreted BSA was equivaleDt stnicturally» bat Ibe recombinant prodnc^ had lower 
10 levels ofexpressicm (recoveiy) and stnictural heterogeneity oon^iared to die blood derived protein 
(9). HSA was also esqxressed in Kliiyveromyces lactisy a yeast with good secretary properties 
achieving 1 ^ter in fed batdi cultures (10). Ohtani et al (11) developed aHSA expression system 
usingiYcftia/w^/£y and established a pu^ 

levekofpurity and properties as die human protdn. JnBadlhissubUliSyBShcoQiAh^BGQXi^ 
IS using bacterial signal pqytides (4). HSAproductionin^ cafiwassacoessfolbutreqiiiTedadditioiial 
in Wl>ti processing with trypsin to yield the mature SijmonsetaL (12) expressed HSA 

in transgenic potato and tobacco plants. Fusion of HSA to die plant PR-SpresequenceiesoMed in 
cleavage of die preseqnence at its natural site and secretion of correctly processed HSA, that was 
indistinguishable firom the authentic human protein. The e?q>ression was 0.0 1 4% of the total soluble 
20 protein. However, iioiieofdiesernelhods have been e^loited commercially. 

Cliallenges in conunerdal production of HSA: Albunodn is currently obtained by protdn 
ftactionatjanfiom plasma and isttiew<»id!s mostused intravenous proteni» esthnated at around 500 
metric tons per year. Attnunin is typically administered by intravenous ixyection of solutions 

25 containing 20% of albumin, lire avenig^ dosage of albumin for each paddit varies between 20-40 
grams/day. The consultation ofafoumin is around 700 Idlogranis per iiiillionh^ In 
additim to hig^ cost, HSA has die risk of transmitting diseases as widi otiier blood-derivative 
products. The price of albumin is about $3.7/g. llms, the niaik^ of this protein ^^[jroximateiy 
amounts to 0.7 billion dollars per year in USA Because of die high cost of albumin, synthetic 

30 macromolecules (like dextrans) are used to increase plasma coUoidosmotic pressure. 

Commercial HSA is mainly prepared from human plasma. This source hardly meets the 
requirements of the wwldmadcet The availabiUtyofhumanplasnia is Unuted and careful 
treatment of the pioductprqiaiodinust be performed to avoidpotenti^ cootamination of tfaeproduct 
by hepatitis, HIV and odur viruses. The costs of HSA extraction from blood are very hi^ 

35 Iimovatiy 6 production sysfcrans are needed to meet the dmiands of the large albumin mado^ wifli a 
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safe product at a low cost Plant biotecfanology o£fers tttc promise of obtaining safe and cheap 
proteins to be used to treat human diseases. 

Chloroplast genetic engineering: When we developed the conc^ of chloroplast genetic 
5 engineering (13,14), it was possible to introduce isolated intact chloroplasts into protoplasts and 
r^enerate transgenic plants (15). Ihearefore, eaily investigations on chloroplast transformation 
focused en tbedevelopment of m organello systems usingintact ddarpplasts cs^mbleof efRcientand * 
prolonged tianso^itioEn and tianslatian (16 - 18) and esqiression of foreign genes in isolated 
chIoropIasts(I9). However^afiertfiedisooveryofthe gene gun as a transformation dm 

10 was possible to transform plant ddoroplasts without the use of isolated plastids and protoplasts* 
C!hloroplast^eticengiiieetingwasaccQnQ>lishedinseveralph Transiente^qiressionofforeign 
genes in plastids ofdicots (21,22) was followed by such studies in monocots (23). Urnqoetodie 
diloioplast gmetic engineering is the development of a foreign gene expression Systran using 
autonomously replicating chloroplast esqnression vectors (21). Stable int^ration of a selectable 

IS madcer gene into the tobacco (^oroplast genome (24) was also a^ 

However, nsefid genes confening vabable traits via dilorqplast genetic engineenng have been 
demonstrated cmly recently. Forexan9le»plantsresistanttoBXsensitiveiQsectswei:eobtainedby 
integratmg the crylAc gene into the tobacco chloroplast genome (25). Plants resistant to B.t 
resistant insects (np to 40,000 fold) were obtained by hyper-expression of &e cry2A gene within the 

20 tobacco chloroplast genome (26). Plants have also been genetically engineered via the chloroplast . 
genome to confer heibicide resistance and the introduced foreign g^ies were maternally inherited, . 
overcomingtheproblemofout-etosswidi weeds (27). Qiloroplastgen^engmeenngtedmology 
is currently bdng s^lied to odierusefid oops (14^). 

25 biyestlgatlons In progress: A lemaikable feature of chloroplast genetic ^igineering is the 
observation of exceptionally large accnmnlation of foreign proteins in transgenic plants, as nmdi as 
46% of CRY protein in total soluble protein, even in bleached old leaves (29, see attached rqwrt De 
Cosa et al. 2001). Stable expression of a pharmaceutical proteiii in chloroplasts was first r^rted 
for 0V6VP, a protein based polymer with varied medical plications (such as Ihe p re v a iti on of 

30 post-surgical adhesions and scars^ wonmd coverings^ artificial p^cazdia, tissue reconstructioh and 
progtammed drug delivexy (30)). Subsequently^ expression of foe human somatotropin via the 
tobacco chloroplast genome (31) to high levels (7% of total soluble protein) was observed. The 
following investigations that arc in progress in our laboratory illustrate the power of this techno! ogy 
to express small peptides, entire opcrons^ vaccmes that require oHgomcric proteins with stable 

35 disulfide bridges and monoclonals that require assembly of heavy/ii^t diains via chq^^onins. 
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Engineeri]^ novd pathways via flie chloroplast gmome: la plant and animal cdls, nuclear 
mRNAs are translated moncKnstronicany. This poses a soious problem ^^vhen^igineei^ 
genes in plants (32). Therefore, to express the polyhydroxybutyrate polymer or Guy*s 13 antibody, 
single gqnes were &stiDtroduced into individual transgenic pl^^ Hien, these plants were bade- 
5 cix)ssedtoreconstitutedieentirepathwayorthecojnpleteprotein(33^4)^ 

long effint. Ye et al. (22) recently introduced a set of three g^es for a short biosyuthetic pathway 
tfaatiesultedinP-caiotBneexiiressiQniniice. bicontzast,niostdilcxqplast genes ofhi^ier plants are 
cotranscribed ^2). Expression of polycistrons via the dhdoroidast genome provides a nrnque 
opportunity to express entire pafl(waysia a single ti^ We have receody used the 

1 0 Bacillus tkurittg^ensis (B t) cry2Aa2 opaxm as a model system to demcHistrate operon expression and 
aystal&imationviadke chloroplast genome (29). C/y2Aa2isthedi5talgeneofadiree>-ge3)e<q)aon. 
The OT/unmediatelyi^streamof cr)/lPi2l codes for aputative chaperonin that fedlitates die folding 
of co'2Aa2 (and other proteins) to form proteolytically stable cuboidal crystals (35). 

Thetcforei» the cry2Aa2 bacterial openm was expressed in tobacco diloroplasts to test die 

15 resultant tcansg^iic plants for increased expression and inqgoved persistence of the accnmnlated 
insecticidal proteai(s). Stable foreign gene int^cadon was confhnied by PGR and Soulbem blot 
analysis in To and T| transgenic plants. C>y2ifa2 operon derived pn>tdn accumulated at 453% of 
die total soluble protein inn[iBture leaves andremained stable even m oldble^ 
figure number 4 in attadied article De Cosa et aL 2001 , 29). This is the highest level of foreign gene 

20 expression ever rq)orted in transgenic plants. Exceedingly difficult to control insects (1 0-day old 
cottonbollwonn,be^armyworm) were kiUed 100% aft^coiisutning trails Electron 
microgr^hs showed die presence of die insecticidal protein folded into cuboidal crystals similar in 

De Cosa et aL 2001, 29). 

25 ' In contrast to cuircndy marketed transgenic plants with soluble CRY proteinsi» folded 

protoxincrystalsareprocessedoiifybytargetiiisectstfaathavealkalineg^^ This approadi should 
improve safi^ of Bt transgenic plants. Absmce of insecticidal proteins in transgenic pollen 
eliininates trixidty to iion-tarptiiisects via pollen. In addition to diese envinnnxientally firiendty 
approves, this observation should serve as amodel system for large-scale production of foreign 

30 proteins widiin chlcHOplastsinafolded configuration enhaiicingthe^ 

stcppurificatioiL This is die first demonstration of expression of a bacterial operon in traiisgqu 
plants and opens die door to engineer novel padiways in plants in a single transformation event 

Ezpresslngsmallp^tidesviatliechloroplastgenome: Sis commonknowledgediat die medical 
3 5 community has been fighting a vigorous batde a^inst drag resistant padiog^ bacteria for years. 
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Cationic antibacterial peptides fiommanmaHaiifihibiaiis andin^^ gpinedmoie atf e n t iop 
overthelastdecade(36). Key features of tfaesecationicpqytidesaiean^positiw 
fbr negatively-chatged prokaiyotic membrane phospholipids over neutral-charged eukaryc^c 
membranes and ttie ability to form aggregates ttiat disn^t the bacterial mexobxane (3 7). 
5 Therearethreenmjorpqrtideswitba-heUcalstructureSyCecropmfitm 

(giant silk mo&X TT^ag^inmg AamXenopus laevis (AGrican frog) and defemins from mammalian 
neutrophils. Magainin and its analogues have been stodied as a faroad-spectrmntopicsda^ 
systemic antibiotic^ a WDnnd4icaling stmmhmt; and an anticanocr agent (38). We have reoenfiy 
observed that a synthetic lytic pqytide (NfS^^ 

10 tobacco chloioplast 09). ThepqitideietaineditslyticactivityagainsttiiepbytD^ 

Pseudomcnassyringae?sx^msii1^^ The 
anti-microbialpqitide (AMP) used in this study was an amphipathic alpha-helixmolecule that has an 
affinity for negatively charged pho^hol^)ids conunonly found in the outer-membrane of bacteria. 
Upon contact with these membranes, individual peptides aggregate to form pores in the 

IS membzane, resulting in bacteriallysis. Because oftfae concentration dq>eadent action of the AMP, 
itwase3q[iressedviathecfalQroplastg^ooieto acconqplishhiglidosedelivefyattfaepointof infection. 
PGR products and Soutfaem bl<^ confirmed cblorqplast integiation of the foreign genes and 
^homoplasmy. Grow^anddevelopmrntoftfaetransgenicplantswasunaffectedbyhyper^expressio^ 
ofthe AMP within ddoroplasts. i^i vz/ro assays wi^T^ and T, plants confiimed that the AMP was 

20 expressed at higih levels (21.5 to 43% of the total soluble protein) and retained biological activity 
z^ffas^sXPseudomonassyrmgae,2kXDBioxp\B^ //i assays resulted in intense areas of 

necrosis around the point of infection in control leaves, "vt^e transformed leaves showed no signs 
ofnecn3sis(200-800)igofAMPattiiesiteofinfectioii)asshowninFig.l. T|/>tv20Y> assays agair^ 
P&eudomonas aerugjmosa (a multi-drug resistant human pathogen) diqilayed a 96% inhibitian of 

25 growthassfaowninFig.2. IheseresnltsgiveanewoptionintfaebattleagainstphytopathDgenicand 
drng-resistant human pathog^c bacteria. Small peptides (Uke insulin) are degraded in most 
organisms. However, stabili^oftiiis AMP in (Moroplasts opens up this coiiyartuicut f M 
of hormones and other small peptides. 

30 Expression of cholm toxin p sabnmt oligomers as a vaceine in diloroplasts: Vibrio ckoleraey 
which causes acute watery diardiea by colonizing the small intestine and producing the enteiotoxin, 
cholm toxin (CT). Cholera toxin is a bexameric AB5 protein consisting of one toxic 27kDa A 
subunit having ADP ribos^ transferase activity and a nontoxic pentamer of 1 1 . 6 kDa B subunits 
(CHTB) that binds to the A subunit and ficiUtates its entry into the i^ CTB 

35 whenadministmdoraUy(40)isapotentnmoosaliD]murK)genwhichc^ 
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tfaeCTholotoxinbypreventixigitfiDmbii^^ This is believed to be a 

result of it binding to eukaxyotic cell surfaces via the g3ngiiostdes» receptors preset on the 
intestinal epitiielial siir&ce, titus eliciting a mucosal innmine response to pathogens (42) and 
enhancing the immune response when chemically coi^led to other antigens (43 - 46). 
5 Qiolcra toxin (CTB) has previously been expressed in nuclear transgenic plants at levels of 

0.01 (leaves) to 03% (tubers) of the total soluble protein. To increase expression levels^ we 
engmemd fte chhxroplast genome to esqxress the CTB gene (47). We cbsismd expression of 
oUgp8nericCnBatlevdsof4-5%of total sohible plant protem as sto PCStand 
SouQiaDQ Blot analyses confinncd stable int^iatiQ& of llie CTB gene into llie cfaloroplast geaome. 

1 0 Western blot ai^lysis showed that transgenic diloroplast e3q)ressed CTB was antigenically identical 
to commercially available purified Cm antigen as shown in Fig. 4. Also^GMrganglioside binding 
assays confirm that dhlcroplast synfliBslzed CTB biiids to the intesti^ 
toxin as ^owninFig.3B. Transgenic tobacco plants wore morphologicaUyindistiiigui^^ 
untiansfimned plants and the introduced gene was found to be stably inherited in the subsequrait 

15 ^eraticm as confirmed by PCR and Southern Blot analyses. The increased production of an 
efficioittraDsinucosalcarrierrnoleciileaiid ddivefy system, like GIB, in chlorc^lasts of plants makes 
plant based oral vaccines and fusion pioteiris with CIB needing 

feasible ai^iroach. This also establishes unequivocally that chlon^lasts are c^)able of forming 
disulfide bridges to assemble fi>rdgn protdns. 

20 

Expression and assembly of monodonals in transgenic chloroplasts: Dental caries (cavities) is 
probably the most prevalent disease of hammnlrind. Colcmizationofteelb by iKimi^ is the single 
most in^Ksrtant ride fictor in the devdqpment of dental ca^ £ iitulianffisanoiirinotile^gram 

positive coccus. Tt nnlanimg tnnth gnrfeflfta smA fsyniheisami g^iM^anc (infinlnMft p^lygaftrWiftp) gfir| 

25 fiuctans jQxmi sucrose using the enzymes g^ucosyltransferase and fiuctosyltransfeiase respectively 
(48). llie ghicaiis play an in^rtant role by aUowiiig the bacteriimi to adhere to the 
sur&ces. After its adherence, the bacterium ferments sucrose and produces lactic acid. Lactic ^d 
dissolves the minerals of the toodi, jmMlucing a cavi^. 

A topical monocloi^ antibody ther^y to prev^t adherence of 51 mutans to teeth has 

30 recently been developed. The iriddenceofcaiiogauc bacteria (in bumans arid animals) and dental 
caries (in animals) was dramatical^ reduced for periods of iqp to two years after the cessation of tfie 
antibod/ thers^y. No adverse events were detected either in the exposed animals or in human 
volunteers (49). TheaiuiualrequirementforthisantibodyintheUSaloneimryev 
metric toa Ihcrefbre^ this antibody was e7q)ressed via the chloioplast genome to achieve higher 

35 Ievelsofe79res8ionaiulproperfolding(50). Iheint^grationof antibodygenesintotfaedilorc^last 
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gpiomewasconfirmedbyPC^andSoxi&emblotanalysis. The expiesdcmofboA heavy and fight 
chains was confirmed by western blot analysis under reducing conditions as shown in Figs. 5A and 
B. The expression of Mlyasseinbledantibocty was confimed by we^ 

reducing conditions as shown in Fig. 5C Thisisdiefirstrq)cutofsuccessfidassaaDiblyofamulti- 
S subunithmnanpioteiniDt[ansgeiiicchlorq>lasta Ptoductionofznonoclonal antibodies at agricultural 
level should reduce their cost and create new plications of monoclonal antibodies. 

ft 

SIgnHlcaitcc: Medical molecular pharming in tetnsggmc plants has been reviewed recently (51). 
Since tfe demand &r dbsap and safe sources of HSA is expected to increase considefably in line 

1 0 CQmingyears» it wouldbe wise to ensure tfiat in the futurefiiispniteinwLllbe available in significantly 
largCT amounts, preferably on a cost-effective basis. Becausemost genes canbeesqnressed in many 
difierent syst^tns^ it is essential to determine which system offers the most advantages for the 
manufacture of the recombinant protdxu The ideal expression system is one tiiat produces a 
maximum amount of safe, biologically active matmal at aminimum cost The use of modified 

IS mammaHanodbwiflircoombinantDNAteclmiqueshasflieadvant^ofresultihgm 

arecloselyrelatedtothoseofnatuxalQdgtn. However, cnlturing&ese cells is intricate and cm 
be canied out on Umited scale. 

The use of microorganisms such as bacteria permits manu&cture on a larger scale, but 
introduce &e disadvantage of producing products, which differ ai^reciably fiom the products of 

20 natural origin. Forexample,proteiiisthatareusuallyglycosylatedinhuraaiisarenotgly^ 
bacteria. Furdi»3nore,hunianpn>teinsthatare^iessedatU^leve]sinjElco/ifi«^ 
an unnatural conformation, acconqraniedby intracellular prec^tation due to lack of prqp^ folding 
and disulfide bridges. Production ofreoombinantprotemsmplantshasnianypQ^ 
forgqMiatingbic^AarnMceuticalsrdevanttDcBTrical medirine. Tbeseindudediefolkwing; (i)p]ant 

25 systems are more economical tlmn industrial facilities usingfermentatian systems; (ii) tecfanologyis 
available for harvesting and|Mrocessingplaxrts/plantproducts onalargescale; ^ii) diminution of the 
purification requirement when the plant tissue containing the recombinant protein is used as a food 
(edible vaccines); Qv) plants can be directed to target proteins into stable, intracellular compartments 
as chioroplasts, or ^qnressed directly in chloroplasts; (v) the anK)unt of recombinant productthat can 

30 be produced a|yproaches mdustrial-scale levels and (vi) healdi risks due to contaminatiQn with 
potential human pattiogensAoxins are minimized. 

It has been estunated that one tobacco plant should be able to produce more recombinant 
protein than a SOO-liter fermCTter of £1 colL In addition, a tobacco plant produces amillion seeds, . 
facilitating larg^scale production. Tobacco is also an ideal choice because of its relative ease of 

3S genetic manipulation and an impending need to e3q>lore altmiate uses for diis hazardous crop. 
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However, witfatite m^tioiiof en:^iiies(e.g. 

transgemc plants are generally low, mostly less dian 1% of the total soluble protein (S2). MayetaL 
(53)discussthispn)blemiismgthefoUowmgexanq)les. Alttoiig^plantdedvedreconibinanthqxi^ 
B sur&ce antigisn was as effective as a commercial recombinant yacdne; the levels of expression in 
5 transgenic tobacco were low (0.0066% of total soluble protein). Even though Norwalk virus capsid 
protein expressed in potatoes caused oral immunization when consumed as food (edible vaccine), 
expression levds weare low (03% of tolal soluble protdn). Inparticular, esqnessian of huinan 
protdns in nadear transgenic plants has been disappointh^rly low: 

0.000017% of fiesb wcig]lit,faaman senmi albuintn 0.02% andeiythropoietin 0.0026% of total soluble 

10 protein (see Tablel in re£ 52). A syn&etic gene coding &r flie human q[>idermal growfii fector was 
exiMressed only 1^ to 0.001% of total soluble protein in traiisgcni 

The cost of producing recornbmant proteins in al&l& leaves was estimated to be 12-fold 
lower than in potato tubers and corcqparable with seeds (52). However^ tobacco leaves are much 
larg|srandhav6niudihig}ierbiomassttanal&I& The cost ofproduction of recombinant proteins 

15 wiUbe50-foldlowerflianthatof£co/{&nneatation(\^ 52). Adecrease 

in insulin expression fix>m 20% to 5% ofbionuiss doubled the cost of prodo^ Esqnnesstim 
level less than 1 % of total sohible protein in plants has been found to be not coimnercially feasible 
(52). Therefore, it is important to increase levels of eTqpresEtion of recombinant pxiteins in plants to 
e:q>loit plant production of pharmacologically important proteins. 

20 An alta:nate ^preach is to express foreign proteins in dilon^lasts ofhigher plants. We have 

recently integrated foreign genes(up to 10»000 copies p^ cell) into the tobacco chlon^last genome 
resulting in accomulatiQn of recombinant proteins up to 46% of tfie total cellular protem (29), 
CUcxoplasttransfonnationutil^estwoflankingsequenc^ 
iriseit foreign DNA into the qiacer region between the fim^ 
'25 &US targeting tixe foreign genes to a precise locatton. This eliminates the *))Ositi(mefiCbcf' and gene, 
silencing fiequently observed in imclear transg^c plants. Chloroplast ^netic CTgineering is an 
environmentally fiiendly approach, minimiTing concerns of out-oross of introduced traits via pollen 
to weeds or other crops. Most importantly, a significant advantage in the production of 
pharmaceutical proteins inchloroplasts is their ability to process eukaiyoticproteins, including folding 

30 andfttmiationof disulfidebrkiges (55). Ch^^eponinproteins areprescait indilorpplasts (56»57)diat 
fimctioninfoldiiigandassemblyofprokaxyotiG/eukaryoticpiotd^ Also, proteins are activatedby 
disulfide bond oxidoAreduction cycles using the cUorpplast &ioredoxin system (58) or chloroplast 
protein disulfide isomerase (59). Accumulationoffidlyassembled, disulfide bondedformofhuman 
somatotropin via diloroplast transformation (3 1) and oligomeric form of CTB (47) and assembly of 

35 heavy andli^diains ofhumanized Guy's 13 antibody in transgenic cUoroplasts (50) provide strmg 
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evidence for SQccessfiilpiocessmg of phannac^ Such folding aiKi 

assembly should eliminate &e need for highly eixpensive in vitro processing of phamiaceutical 
protdDs. For example, 60% of the total operating cost in the production of human insulin is 
associated with in vitro processing (foimatiQn of disufide bridges and cleavage of niethiQnineX34). 

5 

Taken together, low levels of expression ofhunuin proteins in nuclear transgenic plants^ and 
di£Gciil^ in ibidmg; assenibly^prDcesstng of human {Mxiteins in E. coli should make dikxroplasts an 
ideal confiartment for expression of recombmant proteins. Pjcoductton of fanman proteins in 
transgemcddoroplasts also dEamaticaUy lowers the pnidnctim Laigp-scale pn>duction of 
1 0 hmnan soum albumin m plants is a poweifol qyproadi to provide safe treatment to patients at an 
affiinlable cost and provide tobacco fermers alteniate uses for ibis haza^ Tberefore>it 
would be highly advantageous to provide for esqjiession of human serum albumin in transgenic 
tobacco chloroplasts to increase levels of expression and acconq>lish in vivo processing. 



15 SUMMARY OF THE INVENTION 

lids invention synfiiesizeshig^valuepharrnaoeuticalprotdmi^ 
diloroplast expression for pharmaoeuticd protein productioiL CUoroplasts are suitable for ttus 
purpose because of their ability to process eukaiyotic proteins, inchiding folding and formation of 
disulfidebridge8»tiierd>yeliimnatingtheneedforexpensivepo5t-put^ Tobacco 

20 is an ideal dioice for this purpose because of its large biomass, ease of scale-iq> (milHcHi seeds per 
plant) and g^etic manipulation. We use poly(GVGVP) as a fusion protein to enable hyper- 
expression of hmnan sennn alhumin and acconq^lish r^id one stq> purification of fusion pq>tides 
utilizing the inverse tenq>erature transition prc^erties of tt^ We also use human serum 

albumiri-CTB ftsion protein in chloroplasts of nicotine fiee edible tobacco (LAMD 605) for oral 

2S deKvaytoNODmice. 

BRIEF DESCRIPTrON OF DRAWINGS 
Fig. 1 shows a pair of photographs of leaves infected with 10 ^d of 8x10*, 8x10*, 8x10^ and 
8x1 Qi^ cells of P. syringae^ taken 5 days afier inoculatioiL 
30 Fig. 2 is a gr^h of absorbance of 600 im of total plant protein mixed widr S lA of mid-log 

phase bacteria from ovomg^cultare^ incubated^ hours at2S''C at 125 rpm and gcownm LB 
broth overnight. 

Fig. 3 A is a grs^h of CTB EUSA quantification shown as percentage of the total soluble 
plant protein. 

35 Fig. 3B is a graph of CIB-GMl ganghoside binding BUS A assay of plates coated first with 
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GMl gang^osides and BSAweie plated with totdsobiblepl^ 

Fig. 4 is a photograph of a 12% reducmg PAGE of e?q>ressioii of CTB oligomers. 

Figs. 5 A and B show photographs of ledDcing gels of expression and assembly of disulfide 

banded Guy's 13 monoclonal antibody. 
S Fig. 5C is a photograph of a non-reducing gel. 

Fig. 6 is a photogr^h of a Western Blot of expression of HSA viamiclear genome in potato. 

Fig. TisapairoffieqaEDcyhistogiamsindndii^pewCTt^ 
plants expressing difiEcrent HSA levds. 
10 Fig. 8 is aphotpgiaph of a JFestem Blot of expTcaskm of HSAby ddoii^latyectozs in£ . 

cob*. 

Fig. 9 is a photogt^h of a Western Blot of expression of HSA via chlorqplast genome in 
tobacco. 

Fig. 10AisamapofthepIJ)ddoroplasttrB]isfbimationveGto 
IS Fig. 10BisaphotDgrq>hofanAg^iosegBlcontainingPCRprDductsusingtotalp^^ 

as tooiplatB fiom traiisfiKmied plants. 

DETAILED DESCRIPTION 
Expression of HSA via nuclear genome in potato: Recexitly, our collaborators in Spain cloned the 

20 humanHSAcDNA fiomhumanlivercellsandfiisedthepatatinpromoter(whoseexpression is tuber 
specific (60)) along widi the lead^ sequence of PIN n (proteinase II inhibitor potato transit peptide 
lhat directs HSA to the i^xsplast (61)). Leaf discs of Desiree and Kennebec potato plants were 
transformed usmg 4gn>tecrmtm liiin£^^ A total of 98 transgenic Desiree dones and 30 
Kennei>ecdone8wecetestedbyPaR.andwestemblots. WesternblotsshowedtfiatttieiecomLbmant 

25 albumin (rHS A) had been propedy cleaved by the proteinase II inhibitor transit peptide in Hg. 6. 
Expression levels of both cuMvars were very difiTerent among all transg^c clones as expected as 
shown in Fig. 7, probably because of position effects and gene silencing (62,63). The population 
distribution was siniilarhibo&cultivars: tfaeimjorityoftransgenicclanesshowedexpressionlevels 
between 0.04 and 0.06% ofxflSA in the total soluble protein. The maximum recombinant HSA 

30 amount e7q;M:6Ssed was 02%. B^ween one and five T-DNAfaiserticxis per tetroploid genome were 
observed in tiiese dones. Plants with hi^ier protein expression were always dones with several 
copies ofthe HSA gene. Levels ofmKNAwereanalyzedby Northern blots. Therewasacondation 

b«twften transcript levels andrBComhin^tit albiimm a mirmil^ti nn \i\ trmsgjHlic t^1bcrs, TbeN-tCrminal 
sequence showed proper cleavage of the transit pq>tide and the anuno terminal sequence between 
35 recombinant and human HSA was identieal. hihibition of patatin expression using the antisense 
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tec^olog/ did not improve Ifaeainou^ Aveiageejqxressioii level ammg 29 mti sense 

transgenic plants was 0.032% of total soluble protein, with a mflgfimum expression of 0.1%. The 
mayimum HSA e^q^ressioD level observed was 5-10 times more than that reported by Sijmons et al. 
(12). However, hitter levels are needed to make plant derived HSA productian commennally 
S feasible. 

OdorbplastexpressioiiofEKA: WehaveaIsoinitiatedtramsfimnatiQDof1faetDbaccoc±Io^ 
genome fi>r hyperaqxressioa of HSA, wbidii is a new technology that has rqxnted the hi^iest 
expression levds in plants (29). Hie HSA codm compositioii is advantagooos for cixloroplast 
10 expression and no changes in the inicleotide sequence were needed pLD vector was used finraOfte 
constructs. WedesignedsevmlvectorstocpthnizeHSAe}q>iessian. AH these contain ATG as the 
first amino acid of die nisture protein. 

1 - RBS-ATG-HSA : Hie furst vector includes tibe gene that codes for die mature HSA phis an 
additiraal ATG as a translation initiation codon. We included fiie ATG in one of tfieprimas of tfie 

IS IH:3t,Simcleotidesdownstieamofdie<^onqdastprefenedR^ ThedDNA 
sequence of file iiiatare HSA was used as tenqdate. Hie PGR product was doned into PGR 2.1 
vector, exdsed as an EcoRtNotl fiagment and introduced into die pLD vector. 

2- 5TJTRp8bA->ATG>HSA The 200 hp tobacco chloroplast DNA ftagmmit containing the 5* psbA 
UTR (untranslated region) was anq)lified using PGR and tobacco DNA as ten^late. The fragment 

20 was cloned mto PGR 2.1 vector, excised EcoRI-NcoI fragment was insert at the Ncol site of the 
ATG-HSA and finally inserted into the pLD vector as an EcoRI-Notl fragment downstream of die 
16S iKNAi»Domoter to enhance ttauslatiQQ of tfaeprotem. 

3- BtORPl+2-ATG-HSA: ORFl andORI^ofdieBt Ciy2^a2operonweieainplifiedmaPCR 
usmgtiiecQnq>leteoperQnasateiiqiIateL The fi:l^g^leIlt was donediiito PGR 2.1 vector, exci^ 

25 an EcoRI-EcoRV fragment, mserted at EcoRV site with the ATG-HSA sequence andmtroducedinto 
thepLDvectoorasanEcoRtNoafiagment TheORFlax^ORF2weiefii5ediqxstreamof1heAT& 
HSA 

4> BtORFl+2-5TJTRpsbA -ATG-HStA' The SXJTRpsbA was introduced m the vector number 3 
upstream of the HSA in theEcoRV-NcoI site. 

30 Expression of ddoroplast vectors was first tested in Rccii befinDe dieir use in tobacco 

transfimnation because of Aie similarity of protem synflietic madunery (64). Diffirent levels of. 
expression were obtained in £cofi depending on die construct as shown m Fig. 8. UsmgdiepsbA 
S'UTRandliieORFl aiidORF2of1faeci72/4a2ppcn>n,we6btainedhighcr^ 
using only die RBS. We observed in previous ^qp^imcnts that HSA in E, colt is completely 

35 insoluble^probablydueto aninqxroperfi>ldingresultingfrom the absence of d lUs 
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istfaeieasraiK^v^tliepxotdnispredirita^ DifiEeraxt polypeptide sizes 

wmobsared, probably due to inconqyletetranslatioiu Assuming 1bflt£ca/i and cUaroplast have 
similar protein syndesis machineiy, one could expect different levels of expression in transgenic 
tobacco chloroplasts dq)endiag on the regulatory sequeaices> vnih &e advantage that disulfide bonds 
5 are fonned in chloroplasts (31). Thesefourvectorswerebombardedintotob^ccoleavesviaparticle 
bonibardmeDt(6S) and after 4 weeks small shoots appeared as aresult of independent trausftumation 
events. llieydlweretestedbyPCRtoc^edcintBgiationinaeddmqi^ 
IGAandB. llie podthre denes wemtrans&tied to pols. T^^ 

blots showed different levels of egqunession depeaSng on die 5* region used in flic tcansfiomation - 
to vector. Maximimileveb were oteerved in tte plants transfonned with die 

lJTRofthepsbAg^eas^wnmFig.9. Quantification of the HSAandmolecularanalysisof these 
t ransfiiir^Ti^^tR are in progress^ 

1) Evaluation of chloroplast gene expression: A systematic qiproach to identify and 

1 5 pveccome potential hnutatioas of fbcetgn gene ejqnression in diloroplasts of transgemc plants is 
essential, haforniadon gained in lids study should incieasetlientiK^ of ^ 
system by scientists mtoested in cApiessing other fioreign proteins. Therefore^ it i& important to 
systematically analyze transcription, RNA abundance, RNA stability, rate of protein synthesis and 
degradation, proper folding and activity. For example, the rate of transcription of the introduced 

20 HS A gene will be compared with Ae highly expressing endogenous chloroplast genes (rbcL, psbA, 
16SrRNA), using run on transciq>tion assays to d^ennine if the 16SrRNApromoterisoperatingas 
expected Transgmc chloroplast oontaining each of the constructs wifli different 5* regions (see 
jMreluninaiy studies) win be investigated to test their transorq^ Sim]la]iy,trans^ne 
RNA levels win be inonitorad by northeins, dot blots and printer 

25 ibdU 16S iKNA or psbA. These results, along with run on tnmscriptian assays> ^uld provide 
valuable inJ&jrmation of RNA stability, processing, etc. With our past ©qjerieace in expression of 
several fordgn genes, RNA ^ipeaisto be extremelystablebasedonnordzonblc^analysi^ However, 
a systematic study would be valuable to advance utility of tiiis system by other scientists. Most 
importantly, die cffidencyoftrauQsIationwiD be tested in isolated^ andconq>ared with die 

30 highlytranslatedddoropIastpiOtBinCpsbA). Pulae chase e ac peritnefita will tielpftasegftiftranglatioiiftT 
pausing, premature temunation occurs. Evaluation of pocent RNA loaded on polysomes or m 
constructs widi orwithout STJIRs wiU hdpdetenninelhe efGcienc^of theribosomebmding site and 
5* 5ten>loop translational enhancers. Tsk our recent experience, we observed a 200-fold diSerence m 
accumulation of foreign protdns due to decreases in proteol^is confencd by a putative ch^>en)nin 

3 S (29) . Ilierefinre^ proteins from constmcts eqxressing or not expressing the putative ch^seronin (with 
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orwith0iitORFl+2)alioiiMp]x>vide valuable Tfaus^tfusmfinmatiQn 
win be used to mq)iove fhe next genemdon of cUoroplast vectozs. 

2) Expression of the mature protein: lESAisaprc-proteintfaatiscIeavedmtlieN-teiinma^ 
5 to seaete tbe mature proteiiL The codon for translation initiation is in the pzcsequence. In 
chloioplasts^ &e necessily of e)q>ressing the mature protein introduces this additional amino acid in 
the coding sequence. The seqnoceoflhematarepiotein is first subckmedb^jmiingw^ 
to optimize esqpiessLon levds. Subsequent inummological assays in mice are peillnm ed vnAx flie 
protein to investigate if the cxtra-metfaionine can cause immunognuc response or low bioactxvity. 
10 Alteinatively» different systems can produce tbe mature protdn. These systems can include the 
synthesis of a protein fused to a peptide diat is cleaved intracellulaiy ^irocessed) by diloroplast 
enzymes or the use of chemical or enzymatic cleavage after paitia] purification ofproteins firomplant 
cells. 

15 Useofp^fidestiiatarecfeavedincliloropiast: Staub^aLQl) reported chlmiplast expression 
of human somatotropin similar to fte native human protein by using iibiquitin fiisions that were 
cleavedinthestromabyanubiquitin protease. However, 1faeproce8singeffidencyrangedfix>m30 - 
80% and the cleavage site was not accuratew To process diloroplast expressed proteins a peptide 
which is cleaved in the stroma is essential The transit peptide sequaace of the RuBisCo (ribulose 

20 l,5-biqihoq)hatecaibox^ase) small subunit is an advantageous choice. This transitpq)tide has been 
studied in dq)th(66)..RnBisCo is oneoftheproteins that is syatliesized in cyt^ 
postranslationally into the chhxroplast in an energy dsspeaadsaai, process. The transit peptide is 
proteo]yttcallyremovediqpontranq>o^ There 
are several sequences described fiirdifl&nnt species (68). A transit pqrtide consensus sequence for 

25 the RnBisCosinaUsubunit of vascular plants is published by KeegstnietaL(^ 
that are proximal to the C-tetminal (41 - 59) are higUy conserved in theUgherp]^ 
andbelongtothedomainA^iichisinvolvedinen2ymaticcleavage(66). The RuBisCo small subunit 
transit pqptide has been fiised with various marker proteins (69^0), even with animal proteins 
C71,72)» to target proteins to die chlorcplast 

30 Prior to transfiamafion studies^ cleavage efBciency and accuracy is tested by m vitro- 

translation oftfaefiisionproteinaiid in cvsoT^^^ Qncetfae 
correct fi^<Mi sequence fi3rpn)ducing the irmtuieprotem is knowii, such sequence encoding the 
amino tcnninal portion of tobacco cUorppIast transit pq>tide is linked with the mature sequence of 
^protein. Codon CQnq)ositi(moftbetc4)acco RuBisCo small subuxiittran^ 

35 conq>atible with chloroplast optimal translation (see section 3 and Table on page 13). Additional 
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transit peptide seq;uaK:es for targeting and deav^e in flie chloToplast have been described (66). In 
cases wbace the RuBisCo small siibunit transit peptide is not suitable, other transit pqjtides with 
cleavage in stroma can be used The hmienofth^akoids is a good taig^ because tfaylakoids axe easy 
to purify. ttisreladvefyeasytofreeluinenalproteinsdtfaerbysonicatiQnorwilhaveryto 
S XI 00 concentratioa Howeyer, this often requires insertion of additional amino add sequences for 
efScient import (66). 

Useirf^chanicaloreiuymattccteayag^ Thestrat^offhsingaproteintoatagwithafBiiityfor' 
a certain Ugand has been used to oiaible affinity purification of reconihtnant pfoducts (73 - 75). 

10 However»scaleupofthistedinologyisusuallyqoitee;q)ensiye. Ayastnumberofcleavageinetfaods, 
both chraiical and enzymatic, have been investigated for this purpose (75). Chemical cleavage 
mediods have low specificity and die relatively haish cleavage conditions can result in chemical 
modificationsofdierdeasedprodttct5(75). Someof&eaD[zymaticn:icthodsof^dgnificaiidyhi^ia' 
cleavage specificities togetherwiflihigh efficiency, e. g. H64A subtilisin,IgApz0teaseand&ctor Xa 

IS (74J5), but these oa^mes have Aedrawba^ of bemg quite e9q>e3isivew 
l^typsin^whidideavesC-temiixial ofbask^anmo-^ 
to cleave fiisionpiotems (3,76). Despite expkted low spedfidty, trypsin ^ 
usefiil for specific cleava^ of fiision proteins, leaving basic residues within folded protein domaros 
uncleavaged (76). The use of trypsin only requires that the N-taminus of tiie mature protein be 

20 accessible to theprotease and that thepoterrdal internal sites are protected in the native confonnatiotL 
Trypsinlm&eadditioiialadvantageofbeinginexpensiveandreadilyavailable. hi the case of HSA, 
^rfienitwaseaqparesseding. co/iwitfa6additiona1codQnscodingforatrypsmcleava^ site, HSA was • 
processed su coessfiilly intp the noature protem after treatment with fee protease. Inadditio%tfaeN- 
terminal sequence was fi)und to be unique and identical to the sequence of natural HSA, ttie 

25 conversion was conqplete and no degradation products were observed (3). This in vitro maturation 
is selective because correcdy folded albumin is highly resistant to trypsin cleavage at inner sites (3). 
"niis system could be tested for chloroplasts HSA vectors using protein expressed i^ cott. 

Staub et al. (31) d^onstrated diat the chloroplast methionine aminopeptidase is acdve and 
Ihey found 95% of removal of the first mediionine of an ATG^somatotropin inotdn that was 

30 es^TBSsed via the chlorpplast genome. Thereareseveralinvestigatioiistfiathaveshownaveiystrict 
pattranofcleavagebytfai8pq[itidase(77). Mediionme is only removed when second residues are 
glycine, alanine, smne, cysteine, threonine, proline or valine, but if the third amino acid is proline the 
cleavage is inlubitDd. For HSA the second arninoacid is aspartic add, so the cleavage niay not be 
possible. 
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3) Optfmization of gene espres^m We liave reported Ibat fordgn genes are expressed 
between 3% (cry2Aa2) and 46% {cry2Aa2 (^eron) in transg^c cUoioplasts (26^9), Based on the 
outcome of &e evahiation of HS A chloroplast transgenic plants, several ^roaches win be used to 
enhance translation of die recombinant proteins. In diloroplasts, transcriptional regulation of gene 
expression is less important, aldiougji some modolations by light and developmental conditions are 
observed (78). RNA stability ^ipears to be one among the least problems because of observation of 
excesdveaccumulatimof fiKeigotransaipts^attin^ 16^66-foldhighartihantliehighlye]q3ressing 
niudeartransg)BQicplants(79). CUaniplastgq3eexpcessimisrBgulatedtoalai]gse3clBntatd^ 
transCTptional levd. For exaiiq[>le^ 5* are necessaiy for optimd translation of cU 
mRNAs. Shine-Dalg9ino(GOAGG) sequences as weU as a stemrloops^^ 
to the SD sequence are required for efficient translation. A recoit study has shown that insertion of 
flie psbA 5' UTR downstream of the 16S iRNA promoter enhanced translation of a foreign gene 
(GUS) hundred-fold (80). Tlicicfiwrc^ the 200-bp tobacco chloroplast DNA ftagment (1680-1480) 
containing 5' psbAUTRis used. This PCRproduct is tihen inserted downstream of the 16SrRNA. 
promoter to enhance translation of Ifae recombinant proteins. 

Yet another approach for enhancement of translation can be flie codon oonoposition 
optimizatioiL It is reasonable to cxpctii efficient expression in dilotoplasts since the protein is 
translated in E. colL Alfhou^ rbcL (RuBisCO) is the most abundant protein on earth, it is not 
translated as highly as the p^A gene due to the extremely higji turnover of the psbA gene product 
The psbA gene is und^ stronger selection for increased translation efficiency and is the most 
abundant thylakoid protein. Jn addition, the codon usage in higjbier plant diloroplasts is biased 
towards the NNC codcm of 2-fold degenerate groiq)S (i.e. TTC over TTT, GAC over GAT, CAC 
overCAT,AACoverAAT,ATCoverATT,ATA^.). This is in addition to a strong bias towards 
T at tfairdposittcn of 4-fbld degenerate groi^ There is also a context effect tiiiat should be taken 
into consideration while nu)difying specific codons. The 2-fold d^^iexate sites immediately 
upstream film a GNN codon do not show tibis bias towards NNC. (TTT GGA is preferred to TTC 
GGA while TTC CGT is jrofened to TTT COT, TTC AGT to TTT AGT and TIC TCT to TTT 
TCT)(81,82). 

Id addition, highly expressed chloroplast genes use 6NN more ftequoidy dian otho- genes. 
Tbe disclosure found in web site h[ttp'7/www Vftwij^ft,^,j|yi;>i;^^ was used to amlyze codon 
coiiq)osition by GoaQ)aringdi£feient species. Abundance of amino acids in ddoroplasts and fRNA 
anticodons present in c^oroplast was taken into consideration. We also coQqiaredA+T% content 
of all foreign genes that had been csqiressed in transgenic chloroplasts in our laboratory with the 
perccMta^ of chloroplast eaqaession. We found that higjio: levels of A+T always correlated wifli 
higji expression levels (see Table 1). The HSA sequence showed 57% of A+T content and 40% of 
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flie total codons matdied with the psbA most translated codons. Accoidixig to fte data of the Table 
and taking into considacation aU ftese &ctQrs» gooddiloioplast expression of the HSA gene witfioot 
niodifications in its codon con^sition can be e:q>ected. 
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4) Vector constnictifMis: pLD vector is used for the constructs. Ttds vector was devdoped 
for chloroplast transformation. It contains Ihe 16S rRNA promote (Prm) driving die selectable 
maiker gene oadA (aminoglycoside adenyl transferase conferring resistance to spectinomycin) 
followedbythej?ir£ui S'r^on (theterminatorfixHnagenecodingforphotosystemllreaction center 

20 o(Hiq)onents) fiom the tobacco chloroplast genome. The pLD vect(K is a univosal chloroplast 
euqxressioii /integration vectiv and can be used to transform chloroplast genomes of several other 
plant species (14^7) because these flanlring sequences are higjbly conserved among higher plants. * 
The universal vector uses tmA and tmT genes (chloroplast transfer RNAs coding Alanine and 
Isoleucine) &Qm the inverted rq>eat region of the tobacco chl(MX)plast g^ome as flanking sequences 

25 for homoio^us recombination. Because the universal vector integrates fordgn genes within the 
Inverted Rqpeatregion of the ddoroplast genome^ it doubles the cc^y mmiber of the transgene (fiom 
5000tol0»000copiespercellin tobacco). Furtfaennore^ithasbemdCTLoriusttateddiatharnopIam 
is achieved even in the iBrst round of selection in tobacco probably because of fte presence of a 
chloroplast origin of xq>]ication widim ^ flarddng sequence in flie urnvenal vector (thereby 

30 providing more templates fiirint^ration). Because of&ese arid several otiberieasoiiSyforcigpigBiKe 
cxpscssion was shown to be much higher when the universal vector was used instead of the tobacco 
specific vector (30). 

The following vectors can be used to optimize protein expressi on, purification and producti on 
of HSA with the same amino add composition as the human protein. 
35 a) W6iiicr6as6translationusmg&6psbA5*lJrRtooptin 

chloroplastDNA firagment oontaining 5* psbA is anq)lifiedby PCRusingtobaooo dJoroplast . 
DNA as template. Thisfisigmentis cloned directly in the pLD vector muMpledo^ 
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(EcoIU-Kc(d) downstres^ Hie cloned sequence is the 

same as in fte psbA gene. 

b) For enhaiKnngpiotdnstabiUtyand^iHtatiDgpur^ thirmgtensis 
operon derived putative cb^>en>nin is used. ExpressioQ of the arylAal operon la 

5 chloroplasts provides a model system for hyper-e;q>ressian of foreign proteins (46% of total 

sohible protdn) in a folded configoration enhancing their stability and fadlitatingporification 
(29). Thisjustifiesimdiision of fte putative diqien^^ 

the newly desigqedconstracts. Bittnsxegkm there are two cfien leading fiames(ORF 
ORE2) and a rnx)soma] binding site (lbs). Ibis sequence contains elements necessary for 

10 Cry2Aa2 ciystallizatiQn which he^ to fold or ctystalUze the HSA protdn belptng in the 

subsequent pnrificatioD. Successfbl cxystaDization of oth^ proteins using Ibis putative 
chaperomnhasbeendem(HistratBd(35). Wean^UfytbeORFl andORF2oftheBtC/y2/<a2 
opoonbyPCRnsingtbecQnqilete operon as tBiiq>late. The fragment is cloned into a PGR 
2.1 vecbff and exdsed as an EcoKItEcoRV product This fiagmeot is then doneddtrecdy 

1 5 into Ibe pLD vect(xrniultq>le cloning ^ (EcoRI-EcoRV) downstream of tbe promoter and 

the aadA gene. 

c) To obtain EBS A with Ibe same amino acid composition as fte mature human pcotetn (without 
Ibe extra metbionine), we first fiise HSA witb the RuBisCo small subunit transit peptide. 
Also> otii^ constructions are performed to allow cleavage of the piotdn after isolation fiom 

20 chloroplast 

The first set of constructs includes the sequence of HSA beginning with an ATG» introduced 
by PGR. using primers. (>nce(^)ttmale3q>ressionlevdsareacUevedi»andwfa«^ATGis^m 
to be a problem (deteimined by mice immonolpgical assays)^ processmg to produce the mature 
protdn is addressed. ThefirstatteiiqytistheuseoftheRuBisGDsmaUsubunittiansitpc^ This 

25 transit peptide is an^lified by PGR using tobacco DNA as a template and domed into ttie PGR 2.1 
vector. The HSA is fiisedwitb&etransitpeptideusingaMliiIrestrictionsitethat^ 
in tiiie PGR prim^ for aiiq)lificationofthe transit peptide and the HSA coding sequence. Thegene 
fusion is then inserted into die pLD vector downstream of tiie 5 'region that gives optimal esq^ession 
ofHSA(RBS,5Trn^sbA,ORFl+2,ORFl+2-5'lJTRpsbA). Ano&cr ^iproach to eliminate the 

30 ATGofthecodingregionistbeuseoftfieATObefiir6aproteaserecognitioQsequence»^ 

and remove fn vitro such extra sequence to obtain the mature protetrL Svch sequences will be 
introduoedbyprimersinaPGR. After conq>letii^ vector coiistructions^tfie vectors are sequenced 
to confirm correct nucleotide sequence and in fiamefiision. DNA sequencing is performed using a 
Pcrkm Ehncr ABI prism 373 DNA sequencing system or the like. 
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Because of the siniilarity of pEEOtem synthetic madiiiieiy (64), expression of cMoroplast 
vectors is first tested in £a7fibefinctbeir use Far Esdierichia coB 

expression XL*1 Blue strain is used PmificatKm and dcavago assays is pccfojmed using R coU 
10 Gsqitessedprotein. 

5) Bombardmmt, Regeneration and Charai^erization of Chloroplast Transgenic Plants: 
Tobacco (Nicotiana tabacum var. Petit Havana) plants are grown aseptically by gBmunation of seeds 
onMSOntediuoL lids mediuni contains MS salts (4.3 g^taO»BSvit8z^ 

15 100 mg^to^ Ifaiamine-HCa, 10 mg^itei^ mootinic add, 1 mgdite^ pyridoxine-HCl, 1 mgOiter), 
siK3ose(30gniter)andid]ytagar(6g/litBr)atpH5.8. Fully expanded, dark green leaves of about 
two monflL old plants are used for bombardment 

Leaves are placed abaxial side up on a Whatman No. I filter p^er laying on the RMOP 
medium(20)instandardpelripktes(100xl5nim)forbombanim^ Gold (0.6 ^m)microprojectiles 

20 are coated widi plasmid DNA (chloroplast vectors) and bombardments are carried out with the 
biolisticdevicePDSlOOQ/He (Bio-Ra^ as desaibedby Daniell (65)* Followingbombardmait,petri 
plates are sealed witii paiafifan and incubated at 24*^0 under 12 h photopedod. Two days after 
bcHnbardment, leaves are chopped into small pieces of mm? in size and placed (mllie sdection 
medium (RMOPooiitEining500 fi^ml of spectinDiiiymi dihyrimcWnritfe) witfi ahsmal safeiratrhTiig 

25 theiriedimDtndeep(100x25nmi)petriplates(~10piecesperplate). HieregeaieTatedspectinomycin 
resistant shoots are chopped into small pieces (~2mm^ and subcloned into firesh deep petri plates ('-S 
pieces p^ plate) containing same selection mediunL Resistant shoots fiom the second culture 
cyde are transferred to the rooting medium (MSO medium and spectinomycin dihydrochlaride, 500 
m^Iiter). Rooted plants are transferred to soil and grown at 26^C under 16 hour pbotoperiod 

30 conditions for iurdier analysis. 

PGR analysb of putative transformants: PCRisperfonnedusingDNAisolatedfiom control and 
transgenic plants to distinguish a) true chloroplast transformants fiom mutants and b) ddoroplast 
transformants firom nuclear transformants. Primers for testing the presence of the aadA gene (ttiat 
35 confers spectinomycin resistance) in transg^c plants are landed on the aadA coding sequence and 
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16S]fiNAg^e. One primer laoods on die aadAgen^wlifleahofl« 

genome as shown in Fig. lOA to test chloroplast integration of ttie genes. No PGR prodact is 
obtained with nuclear transgenic plants iising this set of primm. The primer set is used to test 
integration of tbe entire gene cassette widiout any internal del^ion or looping out during homologous 
5 recombination, Asiniilarstrat^hasbeenusedsuccessfullybyustocon&mcUorophsti^ 

of foreign genes (26-30). This screening is essential to eliminate mutants and nuclear tiansfonnants. 
Total DKA fionoiunbomibaided and transgenic plants is isolated as described by Edwards et aL (83) 
to conduct PGR analyses in transgenic plants. Chlon^last transgenic plants containing the desired 
gene are moved to second round of sdecdon to achieve hornoplasmy. 

10 

SouthernAnalysisforhomopIasmyandcopynnniber Southern blots areporformedtodetennine 
the copy number ofihe introduced foreign gene per cell as well as to test homoplasmy. Tbereare' 
severalthousandcopiesoftfaechloroplastgonomepresentineadipIantcelL Tlicreforc»iAen foreign 
genes are tns^ted into the cMoroplast genome» it is possible that ^me of the chloroplast genomes 

IS Imefordgn genes integrated while oQieis remain as the wild ty^ Therefore^to 
eiisnre that only the transformed genome exists m cells of transgenic plants (homoplasmy), fte 
selection process is continued. Total DNAfiom transgenic plants are probed widilfae chloroplast 
border (£Iankingi) sequ»ices (the tml-tmA fiBgment) to confirm that the wild type graiome does not 
exist at the end oftfae selection cycle. If wild type genomes are present (heteroplasmy), the native 

20 fragment size is observed along witii transformed genomes. Presence of a large ftagment (due to ' 
insertion of foreign genes within the flanlnng sequences) and absence of the native small fragment 
confirms homoplaany (26^7^0). 

The con^smnber of fte integrated gene is detmninedby estabfidnnghomc^lasmy forthe 
transgRnic chloroplast genome. Tobacco chloroplastsccnitain 5000-10,000 copies of tl^g 

25 perccill(27). When only a fiactionoftiiegmnies are actuaUy transformed, tliec^ 

default, must be less than 10,000. By establishing dxat in the transgenics the gene inserted • 

transformed genome is the only one presoit, it can be established that die copy number is about » 
5000-^10,000 p^ cell. TUs is usually doiie by digesting tiie total DNAwift a suitable restriction 
enzyme and probing with die flanking sequences tiiat enable homologous recombination into the 
30 chloropIastgqnomeL Ilienaftivefragmentpresentintbecontrolshouldbefli^^ 

The absence of native fis^ment proves that only tiie transgenic chloroplast genome is present in die 
G^ell and there is no native^ untransfoocmed, chloroplast graome^ without die foreign gene present 
This establishes tfaehomqplasmic nature of our transformants, simultaneously providing us with an 
estimate of about 5000-10,000 copies of the foreign genes per cell 
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NoitheniAiialyds for transcript stabfltty: NordteniblotsarepeifomiedtotestdieefBdeiicyof 
transcripticm of the genes. Tota] RNA is isolated fiom 150 mg of fiozen leaves by using flie'^Rneasy 
Plant Total KNA Isolation Kir (Qiageai Inc., Chatsworth, CA). RNA (1040 (ig) is denatured by 
formaldeihyde treatmeat,*sq)arated on a 1^% agarose gel in the presmce of formaldehyde and 
S transfmed to a mtrDcelIidosern£nibrane(MSQ as described in 

(HSA gene coding region) is labeled by the landom-primed method (Promeg?) with ^^-dCTP 
isotope. The blot is pre-hybridi2ed,faybiMi2ed and washed as descnb^ 
analysisL T^ansctqpt levels are quantified by the Molecular Analyst Ptogram naxig the GS-700 
Tmaging Dcnsitomctar QBio-Rad^ Hercules, CA) or Ac like. 

10 

Expression and quantification of tiie total protdn expressed in diloroplast: Chloroplast 
expression assays are perfinmed by Western Blot Recombinant protein levels in trarKgenic plants 
of first and second generation (To and Tl) are detennined using quantitative ELISA assays. A 
standard curve is gexieiatedusingknown conoentrations and serial dilutions ofrecomhinant and native 
IS proteins. DtOerent tissues are analyzed using youngs niature and oUleav^ 

C^TordicI[mnunology)antibod^e& Bound]igPisinfiasurBdusiiighorserBdishpen}xidas&-kA»^ 
goat Ig(j (Sigma). 

Inheritance oflntroduced Foreign Genes: While it is unlikely that introduced DNA moves from 
20 &e diloroplast genome to nuclear goiome, it is possible that the gene can be integrated in the nuclear 
genonie during bombaiximent and reniain unddected in Sout^ Th^fore» in initial 

tobacco transfonnants, some is allowed to sdf-pollmate» whereas otfa»s aroused in reciprocal crosses 
with ccmtrol tobacco (transgenics as fbmale accepters and pollen donors; testiiig for matemal 
inhegitanoe). Harvested seeds fTl ) are gE nnrnat eri oo media c nnt a ining qieiAuiniycin. Achievement 
25 of homoplasmy and mode of inheritance can be classified by looking at gemination results. 
Homoplasmy can be indicated by totally green seedling3 (27) while h^enyplasmy is displayed by 
variegated leaves (lack of pigmentation, 24). Lade of variation in chlorophyll pigmentation among 
progeny also underscores the absence of position efTect, an arti&ct of nuclear transformation. 
Maternal inheritance is dcmonstratedbysole transmission of introduced genes 
30 tran^gBmcplants»r^mdks8ofpoB»souice(gre6nseedlingsonsdectiv6m^^ Whentiansg^mc 
poUenisusedforpollinationofcontrolplantSyiesultantprogeiiy^ 

in selective media (will appear bleadied; 24). Molecular analyses confirm transndssidn and • 
expression of introduced genes* and T2 seed is g^erated fiom those confirmed plants by the 
analyses described above. 
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6) PDrffication methods: Thestaiidaidmetibiodofpfiizi£u»donea^ 

teckuques with the crystallized piotemsiiiside&^ Iti tins case, the homog^iiates are 

passed tfarough miracloA to remove cell debris. Centrifugation at 10,000 xg pell^ all foreign 
proteins (29). Ihrotdns are solubilized using pM, temperature gradi^ Tins is possible if the 
5 ORFl and 2 oftheciy2^a2operon can fold and crystallize the lecom When there is 

no crystal formation, other purification meOiods are applied (classical biochemistry techniques). 
Albumin is typically aduiiuisteied in teais of gram quantities> At apnrit^ level of99.999% (a level 
consideied snfficient far other reoombinani protein prqiarations), leconibinant HSA (iHSA) 
impurities on die order of one mg is still irgected into patients. Hence, inqnnities fiom the host 

10 organism must be reduced to a minimum. Fur&emore, purified rHSAimist be identical to Inmian' 
HSA. I)espitethesestringaitrequiranaits» purification cost5riu]Stbekq>tlow. Itisnotqypropriate 
to£^lyconventional processes for purifyiiig HSA originatirig mplasmatopfurify the HSA obtain 
by gene manipulation. This is because the impurities to be eliminated from rHSA diffiarfrpm those 
contained in the HSA originating in plasma. Nainely, iHiS A is ooiitBiiiinated witt, fix* exainple/ 

IS coloring matters diaracteristic to reoombiDant HSA, protons originating in Ae host cell5» 

polysaccharides, etc in partirailar it iaTiM^QgaTytACTiffiMtwtlyftKTi^matft rn^ nrigrnattng m 

Ae host cells, since they are foreign matters far living organisins inchidinghmnan andean cause die 
problem of antigenidfy. 

hiplants, two dififer^metfaodsofEISA purification faavebe^perfonned at lab^ 

20 SijmonsetaL(12)transfonriedpotatoandtDbacco plants witfa>4groZ>a^:ren Fordie 
extraction andpiuificationofHSA, lOOOgof strand leaf tissue was homogonzed in 1000ml cold 
PBS,Q.6%PW,0JniMFMSFandlmM£DTA. The homogenate was clarified by filtration, 
centrifogiBd and the snpematant incobated finr 4 h with 1^ ml polydonal antiHSA coupled to 
Reactigel spheres (Pien»C3iem) in die presence of0.5%Tween 80. The complex HSA-anti HSA--. 

25 Reactigel was collected and washed with S ml 0.5% TwecQ 80 in PBS. HSAwasdesorbedfi^omthe 
reactigel con^lex with 2.5 ml of 0.1 M glycine pH 2.5, 10% dioxane, immediately foUowed by a 
baSerexdiange widiSephadexG25to50ixiMTri5pH8. Thesa]iq>lewasdiQiloadedonaHR5/5 
MonoQ anion exchange colmrm (Pharmacia) and ehited widi a linear NaCl gradient (0 - 350 mM 
NaCl in 50 mM Tris pH 8 in 20 min at 1 ml/min). Fracti ons containing die concentrated HSA (at 29 0 

30 mM NaCl) were lyophiKzed and applied to a HR 10^30 Scpharose 6 cofamm (Phannacia) in PBS at 
03 ml/min. However, this mediod uses a£5nity columns (polyclonal anti-HSA) diat are veiy 
expensive to scale-up. Also, die protem is released fiom the column widiOJMgl^mepH 2.5 that 
^ically denatures the protein. Therefore, this method can be suitably modified. 

The second mediod is used fixr HSA retraction and purification fiom potato tubers (Dr. 

35 Mingo-Castel*s laboratory). After grinding die tuber in idiospbate buffer pH 7.4 (1 ragftad), die 
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homog^iate is filtered in nuraclolii and centiifiiged at 14,000 ipm 15 minutes. After ttiis stop, 
another filtration of the siq)erQatant in 0.45 fim filters is necessary. Then, chromatography of ionic . 
exdiangp in ITIX! using a DEAESepharose Fast Flow colunm(Am^ha^ Fractions 
recovered are passed through an afBnity cohmm (Bhie Sq)liarose fast flow Amersham) resulting in 
a product of high purity. HSA purification based on both methods can then be iavestigated. 

7) Characteriaation of the recombinant proteins: For the safe use of recradnnant proteins 
as areplacement in any of the current ^)plicaliaDS,tiiese proteins must be structurally equivalent and 
most not oQQtamahnoriiialfaost*dcnvedmodificati(His. To confirm compliance with Ibeso criteria 
human and recombinant proteins can be compared using the currently hi^y sensitiye and highly 
resolvmg techniques expected by the regulatory authorities to characterize recombiiiant products 
(85). 

1- Amti^ftaCT ^analvsis: N-tenxdnal sequence analysis isperfonned by Edman d^iradationusii^ 
ABI477ApiotBmsequeQoerwiaLanon-ljiie 120Aphen^tMohydantoinraniinoaddanalyzen 
Automated C-terminal sequence analyasuses aHewlett-Packard Gl OOgAprotan sequencer. 
The C-tenninal ti^tic pq>tide is isolated from tryptic digests by reverse-phase HFLC to 
Gonfinn die CMenninal sequence to a greater mimber of residues. 

2- Protein folding and disulBde bridges farmatton : Western blots with reducing and non- 
reducmg gels is performed to check protdn folding. Protein standards (Sigma) are loaded 
to conq)aie fee mobihty of fee recombinant protein, PAGE is perfoimed on PhastGek 
(Pharmacia Biotedi). Proteins are blotted and then probed wife goat anti-HSA antibodies. 
Bound^lQ is detected wife horseradish peroxidase-labelled and goat IgG and 

X-ray fihn usmg ECL detecdcm reagents (Amersham). 

3- ChramatographlctBchniqnesr PnrHSA^ anaTyrirail p>l^ermpgtffmHPT17tttpi»rfhiTfiRiliigmg 
a TSK G3000 SWxl cohmm. Prqiaratiye gel permeation chromatogR^hy of HSA is 
performed using a Sephaciyl S200 HR column. The monomer fiactioD, idrattified 
absoibance at 280 nm, is dialyzed and reconcentrated to its starting concentration. 

4- \^scositv: Tliis is a classical assay for recombinant HSA. ^\^scosity is a diaracteristic of 
proteins related directly to feeir size, shape, and conformation. The viscosities of HSA and 
recGmbinantHSAarexneasuredatlOOmgMlrlmOJSMNaausingaU-tttbeYi^ 

^ type, Foulton, Selfe and lid, Essex, UK) at 2SX;. 

5- Glvcosviation : CMirpplast|TOteinsaTenotknowntobeg)ycosjdated. However feere are 
no publications to confirm or refote this assumption. Therefore, giycosyiation will be 
measured using a scaled-up veraon of fee mefeodof Ahmed and Furfe (86). 
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8) Animal testily and PreCHiilcal Trials: Wheaalbumm is inroducGd at adequate le^ 
tobacco and IfaephysicbchCTiical properties of the product conespond to those of dienatoral protein, 
toxicology studiesneed to bedonetQinice. To avoidmicere^onseto the faumanprotein, transgenic 
mice carrying HSA genoniic sequences is used (87). Aftw injection of none, 1 , 1 0, 50 and 1 00 mg 
of purified recombinant protein, classical toxicology studies are carried out (body weight and food 
intake, animal behavior^ piloerection, etc.). Albumin can be tested for blood vohmie replacement 
afi^ paracentesis to efiiniiiate the fhiidfiDom the peritoneal cai^ It 
basbcca shown that albumin infusion after this nianeuvcr is essential toprescrve effectived 
volume and tcnal function (88). 
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EXPRESSION OF HUMAN THERAPEUTIC PROTEINS 
IN TRANSGENIC TOBACCO CHLOROPLASTS 

BACKGROUND OF THE INVENTION 
5 FIELD OF THE INVENTION 

The presoit inventioii is directed to tiie e^qxression of genes in plants to produce 
lecombiDant pEOteins* 

DESCRIPTION OF RELATED ART 
Research cm human pzoteins in the past years has xevohitioaized the use of these 

10 tiiei^eatically valuable proteins in a varied of clinical situations^ Since ^ danand jS»r these 
proteins is expected to increase ccHisiderably in tiie coming yeais^ it wo 
in tibte future they wiB be available in sigui&antly larger amounts, preft^ably on a cost-efEective 
basis. Because most genes can be expressed in many different systems, it is to 
detemnne ¥4iich system offers the most advantages for the manu&cture of fiie recombinant 

15 protein. The ideal expression system would be one that produces a mannnun amount of safe, 
biologically active materia] at a minimum cost The use of modi&d mammaliaa cells with 
recombinant DNA techniques has die advantage of resulting in products vriiich are closely 
related to those of natural ocigm; however, cnltnring of these cells is intricate and can cxily be . 
earned out on limited scale. The use of microorganisms such as bacteria permits manufacture on 

20 a larger scale, but introduces Hie disadvantage of producing products^ which dififer ^reciably 
firom products of natural origin. For example, proteins that are usualfy g^cosylated in 
humans are not glycosylated by bacteria. Furthemoore, human iMX>teins that are expressed at high 
levels in K coU firequen^ acquire an u n uatui al confomiation, accon^panied by intcaoelhilar 
precipitation due to lack of pn^ier folding and disulfide bridges. Production of recombinant 

25 proteins in plants has many potential advantages fixr generating biopharmaceuttcals relevant to 
clinical medicine. These indnde the £:»Uowing: (1) plant systems are more economical dian 
industrial fedlities using fomentaticMi systems; technology is available fat harvesting and 
[HXicessing plants/ plant products on a large scal^ (iii) eliminatian of die purification reqimement 
when Ibe plant tissue containing die recombinant protein is used as a food (edible vacdnes); Qn) 

30 plants can be directed to target protems into stable, intracellular campaitments as chlon^lasts» or 
- expressed direcdy in chloroplasts; (v) the amount of recombinant product that can be produced 
approaches industrial-scale levels; and (vi) health risks due to contamination with potential 
human pathogens/toxins are minimized. 

It has been estimated that <me tobacco plant should be able to produce more recombinant 

35 protein ttmn a BCX^liter fermenter of K coiL In addition, a tobacco plant produces a miUion 
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ing laige-scale production. Tobacco is also an ideal choice because of its relative 
ease of geiietic manipulation and an in^>endiag need to explore alternate uses for this hazardous 
crop. However, wi1h die exception of enzymes (e.g. phytase), levels of fittreiga proteins produced 
in nuclear transgpnic plants are generally low, mostly less than 1% of the total soluble protein 
5 (1). May et al. (2a) discuss this problem using the followmg examples. Although plant demed 
reco mbin a nt hq[mtitis B sut&ce antigen was as effective as a commercial recombinant vaccine, 
the levels ofexfxcs^m in transgenic tobacco were low (0.0066% of total sofaible protein). Even 
thou^ Norwalk vims capsid pn^ein e^qsessed in potatoes caused oral immunizatioh when 
ccmsumed as food (edible vaccine), expressicxi levels were low (0.3% of total soluble pioteinX Bi 

10 particulBr» eqsressiQn of human protdns in imdear tcansgenie plants has been dis^ifpointing^ 
low: e.g. human Interferon-D 0.000017% of fiesh weigjbt, hmnan s^um albumm 0.02% and 
eryduropoietin 0.0026% of total sohible protein (see tablel in refl). A synthetic gene coding for 
the human q)idennal growdi &ctor was oqnessed only up to 0.001% of total soluble protein in 
transgenic t(d)acco (2a). The cost of producing recombinant proteins in alM& leaves was 

15 estimated to be 12-fold lower dian in potato tubers and canq)arable with seeds (1). However, 
tobacco leaves are much larger and have much high er biomass than alfidfiL The cost of 
production of recombinant proteins will be 50-fdd lower dian dmt g£KcoU fermftnfa»ii'f»n (wi& 
20% ejqxressioii levels^ 1). A decrease ht insulin expression fiom 20% to 5% of biomass doubled 
the cost of production (2b). Esqwession level less than 1% of total soluble protein in plants has 

20 been fomid to be not ecMnmercially feasible (1). Therefore, it is important to inoease levels of 
expression of recombinant proteins in plants in order to ejqiloit plant production of 
pharmacologically inq>ortant proteins. 

An alternate approach is to ecqxr^ foreign iHX)tein3 in chloroplasts of higher plants. We 
have recently int^rated foieiga genes (iq> to 1 0,000 copies per cell) into die tobacco chloroplast 

25 gpoome resnltmg in accnmolation of recombinant proteins 1^> to 47% of the total celhilar protein 
(3). Chloroplast tiansfoiniation utilizes two flanking sequences diat, tfarougji homologous 
recombinatioi^ insert foreign into die spacer region between die functional genes of the 
chloroplast genome, dms targeting the foreign genes to a precise location. Hiis eliminales the 
''position effectf* and gene silencing firequendy observed in imclear tnmjt^py^ip. plantSL 

30 Chloroplast genetic engtneenng is an environmentally fiiendly q[y[»oach, mmimiTiiig concerns 
ofout<n)ssofintroduced traits via polloi to weeds or other crops. Also, the concerns of insects 
devdoping resistance to biopesticides are minimized by hyper-expression of single insecticidal 
proteins (hig^ dosage] or expression of different types of insecticides in a single transformation 
event (gene pyramiding). Concerns of insecticidal proteins on non-target insects are minirriized 

35 by lade of expression in transgenic poll^ Most inaportantly, a significant advantage in the 
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phamiaceDtical protems in chloroplasts is their ability to process eukaiyotic 
proteins, inchiding folding and fbnnation of disulfide bridges (4). Chaperonin proteins are 
present in chloroplasts (5,6) that function in folding and assembly of prokaryotic/eukaiyotic 
proteins. Also, pioteiiis are activated by disulfide bond cxidQ/reduction cycles using the 
5 chloroplast thioredoxin system (7) or chloroplast protem disulfide isommse (8). Accmnulation 
of fully assembled, disulfide banded fonn of faun^ somatotropin via chloroplast transfbnnation 
(9) and obgomeric &am of CTB (10) and assembly of heavy and li^ chains of faumaniised 
Ooys 13 antibody in transgenic chloroplasts (11) provide strong evidence successful 
processing of pJiannaoeutical pioteios inade ddon^lasts. Sudi fbldii^ and assembly should 
0 dmiinate the need fir highly eoqiensive in vitro processing of phannaceotiical protems. For 
esmo^e, 60% of die total pperatmg cost in die production of famxi^ 

processing (facmation of disulfide bridges and cleavage of methionine)^). 
Taken togc^her^ low levels of e^qxressioD of human proteins in nuclear transgenic plants, 
aiKl difficulty in fohhng, assembly/processing of human proteins in E.coli should make 
5 diloroplasts an ideal compartment far esqnession of these proteins; production of human proteins 
in transgenic diloroplasts should also dramatically lower the produdiQn cost Large-scale 
producticm of diese proteins in plants should be a powerfol approach to provide treatment to 
patients at an affordable cost and provide tobacco £anners alternate uses for this hazardous ccop. 
Therefore, we propose here expiessicm of therapeutic proteins m transgenic tobacco chlcnoplasts 
0 to increase levels of eiqiression and accomplish i>i Wvoprocessmg, 

BRIEFDESCRIPTION OF THE FIGURES 
Figure 1 is a graphical rqxresentation of total protem versus leaf age in transgenic tobacco 
plants. 

Figure 2 is an dectron microgrq^ showiiig Ciy2Aa2 crystals in a traiisgfraiic to 
5 Figure 3 is aphotpgrai^ of leaves infected with P. syringae 5 days after inocnlatioiL 
Figure 4 is a gr^;^showiiig the results ofan in vitro assay of P. aentgptasa. 
Figure 5 is two giaidis showing oligomeric CTB expression levds as Total Soluble Protein. 
Figure 6 is a Westenx Blot Analysis of transgBoic chbroplast expressed CTB and commercially 
available purified CTB antig^ 
0 Figure 7 is a Western Blot Analysis ofheavy and light diams of Guy's 13 monoclonal antibody 
firom plant chloroplasts. 
Figure 8 is a Western Blot of transgenic potato tubers^ cv Desiree. 
Figure 9 is a fiequ^y histogram including percentage K^ebec and D^ir^ transg^ci 
plants esqpressing different HAS levels. 
5 Figure 10 is a Western Blot of^Lo)/! protein extracts. 
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HUMAN SERUM ALBUMIN 

HSA is a monomelic globular protein and consists of a single, gec^rally nonglycosyiated, 
polypqrtide chain of 585 amino adds (66.5 KDa and 17 disulfide bonds) v(i&x no 
postranslational modifications. It is composed of ttnee structurally similar globular domains and 

5 the diadfides are positioned in repeated series of nine loop-link-loop stmctures centered around 
eight sequential Cys-Cys paiis. HSA is initially synthesized as pre-pro-albumin by the liver and 
released from tiie endoplasmatic reticulum after removal of the aminotenninal prq>eptide of 18 
amino adds. The pro-albumia is further {accessed in the Golgi ccrnqdoL where Ifae^ oOier 6 
aminotemiinal residues of the fxropeptide are deaved by a serine protdnase (12). This results in 

0 tiie secretion of the mature po^peptide of 585 amino adds. HSA is encoded by two codonmiant 
autDBomic allelic genes. HSA bdong? to the multig^e &mily of proteins ihat include atphsL- 
fetoprotem and human gcoiqnqpecific component (Gc) or vitamin D-binding &mily. HSA 
&dhtates transfer of many ligands across org^n circulatory int^i^ces such as in the liver» 
intestine, kidney and brain. In addition to blood plasma, scrum albi^min is also found in tissues. 

5 HSA accounts for about 60% of die total protein in blood serum. In die serum of human adnlts» 
the concentration of albumin is 40 m ^fml. 

The primary fimctian of HSA is to maintenance of coUoid osmotic pressure (COP) 
within the blood vessels. Its abundance makes it an inqxutant detenm'nant of die 
phannacddn^c behavJor of many drugs. Reduced synthesis of HSA can be due to advanced 

0 liver disease, inqpaired intestinal absorption of nutrients or poor mitritional intake. Increased 
albumin losses can be due to kidney diseases (increased glomerular permeability to 
macromolecules in the nephrotic syndrome), intestinal diseases (protdn-losing enteropadiies) or 
exudative skin disorders (bunis). CataboHc states such as dnonic infecticms, sepsis, surgery, 
intestiikal resection, trauma or extensive bums can also cause hypoalbuminemia. HSA is used in 

5 ibaapy of blood vi^nme disocdera, fiir eKanq>le posthaemonhagic acute hypovolaemia or 
extensive boms, treatment of ddiydiation states, and also for dnhotb and hepatk iOnesses. It is 
also used as an addittve in pei&sbn Uqmd fi)r extcacoipored drcuhtioi^ 
for repladi^ blood volume, but also has a variety of non-dierq^eutic uscs^ including its role as a 
stabilizer in frxmmlations foor other therapeutic proteins. HSA is a stabilizer for biological 

D materials in nature and is used for preparing biological standards and reference materiala 
Furthermore, HSA is frequent^ u^ as an experimental antigen, a cdl^KsiilturB constitnent and a 
standard in clinical-chemistry tests. 

The e7q>ression and purificadon of recombinant HSA from various microorganisms has 
been reported previously (13-17). Saccharomyces cerevisiae has been used to produce HSA both 

5 intracellulazy, requiring denatnration and refokhng prior to analysis (18), and by sec^tion (19). 
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was eqiuvaleiit stracturally, but the recombinant product had lower levels of 
egression (recoveiy) and stroctuial heterogeneity conq>ared to ^ blood derived protein (20). 
HSA was also expressed in Kluyveromyces lactis, a yeast with good secretary properties 
achieving 1 g/liter in fed batch cnltnres (21). Ohtani et al (22) developed a HSA expression 
5 system using PiMa pastarls and establidied a puriflcatian method obtaining recombinant 
protein witii stmilar levels, of purity and properties as the Mman protein, in Bacillus subtilis, 
HSA could be secreted using bacterial signal pqytides (15). HSA production m K edit was 
successful but required additioanal m vitro processing with trypsin to yield the mature pio t eia 
(14). Sijmons et aL (23) expressed HSA m transgenic potato and tobacco plants. Pkision of HSA 

10 to the plant PR-S pssequence resulted in deavage of ttio presequence at its natural sfce and 
secretion of conecdy processed HSA, that was indistinguishable fipom the authentic human 
proteiiL The expression was 0.014% of the total soluble proton. However, none of these method 
havebeen e;q)loited commercially. 

Albumin is currently obtained by protein fiactionation fi:om plasma and is the world's 

15 most used intravenous protein, estimated at around 500 metric tons per year. Albumin is 
administered by intravenous injection of sohitioi^ containing 20% of albumiiL Hie average 
dosage of albumin for eadi patient varies between 2(M0 grams/day. The consumpticm of 
albumin is around 700 kilograms per million habitants per year. In addition to the hig^ cost, HSA 
has the risk of transmittmg diseases as with other blood-derivative |xodoct5. The price of 

20 albumin is about $3.7/g. Thais, ^ market of this protem ap pr oxim ately amounts to $ 2,600,000 
per million people per year (0.7 bilHon dollars per year in USA). Because of the hi^ cost of 
albumin^ synthetic macramolecules (hke dextrans) are used to increase plasma coUoidosmotic 
pressure. 

Commercial HSA is mainly prepared fiom human plasma. Hus source^ hardfy meets die 
25 requirements of tbe worid madD^ The availability of human plasma is limited and careful heat 
treatment of the product prepared must be performed to avoid potential oontaminatioa of the 
product by hqntitis, HIV and odier viruses. The costs of HSA extraction fiom blood are very 
higjL In order to meet the demands of ^ large alburnin market with a safe product at 
iimovBtivc production systems are needed Plant biotechnology ofTers pronnse of obtainiiig safe 
30 and cheiq) proteins to be used to tteat human diseases. 

INTERFERON ALPHA 
Mt^^ns (IFNs) constitute a heterogeneous family of cytokines witii antiviral, 
antigrowth, and inmmnomodulatory properties (24*26). Type I IFNs are acid-stable and 
35 constitate tibe first line of defence against viruses, both by difiplaying direct antiviral ejects and 
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with ttie cytokine cascade and tlie iimnime system. Their fimction is to induce 
regulation of growth and differentiation of T cells. The human IFK-hx femily consists of at least 
22 intronless genes, 9 of which are pseudogenes and 13 expressed genes (sabtypes) (27). Human 
IFN-a genes encode proteins of 188 or 189 amino acids. The first 23 amino acids constitute a 
5 signal peptide, and the oth^ 165 or 166 amino acids fomi die mature protein. IFN-a subtypes 
show 78-94 % homology at the nucleotide leveL Presence of two disuMde bonds between C^s- 
l:Cys-99 and Cys-29:Cysl39 is conserved among all IFN-a species (28). Human IFN-a gpnes 
are egq;xressed constitutiyely in organs of normal individuals (29^0). LadLvidoal IFN-a g^es aie 
dififeccotiy expressed depending cm the stmmhis and ttey restricted cell type expression 
10 (31). Ahhoagti all IFN-a subtypes bind to a common rccqitor (32), several iepc»1s saggGst tibat 
th^ show quantitodvety distmct patterns of antiviraU gmwtti i^ 

activities (33). JFH<iB and IFN-a5 seem to have ttie greatest antiviral activity in liver tumour 
cells HuH7 (33). IFN-oS has^ at least, the same antiviial activity as IFN-ot2 in m vitro 
esqpedmenis (unpublished data m Dr. Prieto^s lab). It has been shown recently ttmt IFN-oS is the 

15 sole IFN-a subtype esqiressed in normal liver tissue (34). IFN-ctS expression in patieiKts with 
cfaronic hepatitis C is reduced in the liver (34) and induced in mononuclear cells (35). 

Interferons are mainly known for their antiviral activities against a wide q>ectnmi of 
viruses but also for their protective role against some non-viral pathogens. They are potent 
imnmnomodulators, possess direct antiprolifciatiYe activities and are cytotoxic or cytostatic for a 

20 nrnnber of different tumour cdl types. IFN-a is mainly employed as a standard therapy for hairy 
cell leukaemia, metastasizing carcinoma and AIDS-associated angiogenic tumours of mixed 
cettnlaxrty known as kapoai sanunnaa. It is also active against a number of other tmnouxs and 
viral infections. For munple» it is ^ cunent sq)proved fbiesspy for cinonic viral hqpatitis B 
((IHB) and C (CHC). The IFN-<x subtype used for chrome vi^ 

25 of patients with CHB and about 25% of patients with CHC respond to this therapy with susiained 
viral clearance. The usual doses of IFN-a are 5-10 MU (subcutaneoi2S injection) tiuee days per 
wedc for 4-6 months for CHB and 3 MUdireeckys per week for 12 months 
of IFNa2 r^xresent approximatety 15 Dg of recombinant proteirL The respoi^ fate in patients 
with dmmic hq;)atitis C can be increased by combining IFN-a2 andnbaviiiiL This combination 

30 ther^, ^Addk considerably increases the cost of the therapy and causes some additional side 
effects, results in sustained biochemical and virological remission in about 40-50% of cases. 
Recent data suggest tiiat pegilated interferon in weekly doses of 180 Dg can also increase the 
sustained re^ion&e rate to about 40%. IFN-a5 is the only IFN-a subtype expressed in liver; this 
expression is reduced in patients with CHC and IFN-a5 seems to have one of the highest 
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in Hv^ tmnour cells (see above). An intemadGnal patent to use IFN-<x5 has 
been filed by Prieto's group to facilitate commercial development (36), 

Hnman interfenHis are cunren^ prepared in microbial systems via recombinant DNA 
technology in amounts which camiot be isolated from natural sources (leokocytes^ fifaroblastSy 
5 lyn^ihocytes). Different recombinant interfeKQn-D gj»ies have been cloned and expressed in K 
coll (37a,b) or yeast (38) by several gracq)& Generally, the synthesized piotein is not correctLy 
folded due to the lack of disnlfide hddges and tiserefore, it remains insoluble in inclusion bodies 
that need to be sohibilized and refolded to obtain the active int etfeton (39,40). One of fhe most 
efficient methods of intafisEon-D eatpression has been published recently I^BabuetaL (41). bi 

10 tihismetfaod,£l co/ioe&tiansfoaned with interferon vectors (regulate 

promoters) were grown m high ceU density cultures; this resulted in the production of 4 g 
interfeion-Q/Iiter of culture. Expression resulted exchisively in the jfonn of insoluble inclusiCMi 
bodies which w^ sohibilized under doiatunng conditions, refolded and purified to near 
homogeneity. The yield of purified interferon-^D was approximately 300mg^ of culture. 

15 B7q[irBssion in plants via the nuclear genome has not been very successful. Smimov et al. (42) 
obtained tiansfoimed tobacco plants with Agrobactertum tunufadens using the in t e rferon-D 
gene under 35S CaMV promoter but the eoqxression level was very low. Eldelbaom et aL (43) 
showed tobacco nuclear tranaf<Hmadon whfa Inter fe ron-D and die ^spression level detected was 
0.000017% of fiesh weight 

20 The number of subjects infected with hepatitis C virus (HCV) is estinnated to be 120 

million (5 million in Europe and 4 million in USA). Seventy per cent of the infected people have 
abnormal liv^ function and about one timd of these have severe viral hepatitis or cirrhosis. It 
might be estimated however diat there are about 10,000-15,000 cases of dnraic infection widi 
hq>atitis B virus (HBV) in Europe^ a slig}idy lower munber of cases in USA. la Asia the 

25 prevalence of chronic HCY and HBV infection is v^ higji (about 110 million of people are 
infected by HCV and about 150 mimons are infected l^r HBV). In Afiica H 
prevalent Since ui3r«nittmg chroruc viral hqiatitis leads to Hver dnfaosis and eve^ 
cancer, the fai^ prevalence of HBV and HCV infection in Asia and Afiica gf^^wryti? for Iheir 
vary high incidence of hepatocellular carcinoma. Based on these data, the need fer IFN-a is 

30 laige. WN-a2 is cutrently produced in microorganisms by anumber of conqianies mi Ae price 
of 3 MU (15 Dg) of recombinant protrin in tiie western market is about $25. Thns» the cost of 
one year IFN'<x2 Iher^y is about $ 4,000 per patient This pice makes this product unavailable 
for most of the patients in tiie wodd suffering from dironic viral hepatitis. Clearly methods to 
produce less e3q[>ensive recombinant piotdns via plant biotechnology irmovations would be 
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s anthira] therapy widely available: Beddeg^ if IFM-oS is more ejBBcieiit than ii*N- 
oi2» tower doses may be xequired. 

INSIJLIN-LIKE GROWTH FACTOR-I aGF-1) 

5 The InsQliiKlike Growth Factor protein, IGF-I, is an anabolic hozmome wi& a complex 

maturation process. A siogte IGF-I gene is transcribed into several znRNAs by alternative 
splicing and use of difTerent tianscr^stion initiation sites (44-46). Depending on die choice of 
SEriidng, two immature proteins are produced: IGF-IA» cxptcssoA in several tissues and IGF-IB, 
mostly expressed in liver (45). Bodi pre-proteins produce the same matnre protem. A and B 

0 iiTiHiature fimns have diffcreuL lengths and oon^x)sitKQn» as llieir termini are modified post- 
translationany by glycosylation. However, these ends are processed in the last step of matoiatiQn. 
Mature IGF-I protein is secreted, not g^lycosylated and has three rtianlfirtfi bonds^ 70 amino iKnds 
and a molecular w^g^ of 7.6 kD (47-49). Physiologically, IGF-I expression is induced by 
growth hormone (GH). Actually, the knodc out of IGF-I in mice has shown that several functions 

5 attributed originally to GH are in &ct medfatPd by IGF-L GH production by ad^hypofisis is 
repressed by feed=badc inhibrtion of IGF-L GH induces IGF-I synthesis in different tissues, but 
mostly in liver, where 90% of IQF-I is produced (48). The IGF-I leceptoi is expressed m 
difTerent tissues. It is formed by two polypeptides: alpha that interacts with I(ff-I and beta 
invc^ved m signal transduction and also present in the insulin receptor (50,51). Thus, IGF-I and 

} insulin activation aresnnilar. 

IGF I is a pot^ nrultifianctional anabolic hormone produced in the liver upon 
stimulation by growth hormone (GH). In livar cirrhosis the reduction of recqstors for GH in 
hepatocytes and the diminished synthesis of the liver parenchyma cause a progressive fell of 
serum IGF-I levels. Patients with liver cirrhosis have a rmmber of systemic denangements such 

S as muscle atrophy, osteopenia, hypogonadism, protein-calorie mafamtritian which could be 
related to reduced levels of circulating IGF-L Recent studies from Pricto*s hboratoiy have 
demonstrated that treatments with low doses of IGF-I induce significant improvements in 
nutritional status {52% intestmal absorption (53-55), osteop^iia (56)» hypogonadism (57) and 
liver fimdion (58) in rats with experimental liver cirrhosis. These data support that IGF-I 

) d^dency plays a padiogauc role in several systemic complications occurring in liver cirthosi& 
The liver can be considered as an endocrine gland synthesising a hormone sudi as IGF-I with 
important physiological fimctions. Thus liver cinhosis should be viewed as a disease 
accompanied by a hormone deficiency syndrome for which r^laccment therapy with IGF-I is 
warranted. Clinical studies arc in progress to ascertain the role of IGF-I in the management of 

3 cirrhotic patients. IC^-I is also being currently used for Laron dwarfism treatmoot These 
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liver GH receptor so ICT-I is not e?spressed (59). Also IGF-I, acting as a 
hypogtycemianl, is givea togetlier with insulin in diabetes mellitus (60,61). Anabolic e£^ts of 
IGF-I are used in osteoporosis treatment (62,63) hypeicatabolism and starvation due to burning 
and HIV infecti(Hi (64^65). Unpublished studies indicate that IGF-I could also be used inpatients 
5 with aiticalar degenerative disease (osteoarthritis)L 

Hie potency of IGF-I has enoouiaged a great number of scientists to try IGF-I esqsiession 
in vaxious micnxxgffnisms due to the small amount present in human plasma. Production of IGF- 
I in yeast was shown to have seveial disadvantages like low fomentation yiehis and of 
6btain]ng.undesiTabIe g^jyoos^atLon in these molecules (66). Esqxressxoii in bacteria has been die 

10 most successfoi qiproach, eidier as a secreted form fiised to protein leader sequences (67) or 
fused to a solubilized afiEnity fusion parotem (68). In addition, IGF*I has been produced as 
insoluble inclusion bodies fused to protective polyp^tides (69). Sun-Ok Kim and Young Lee 
(70a) e^^ressed IGF~I as a truncated beta-galactosidase fusion protein. The final puri&ation 
yielded ajpcoximately 5 mg of IGF-I haviug native conformation per liter of bacterial culture 

15 IGF4 has also been expressed in animals. Zinovievaetal. (70b) rqxirted an expression of 0.543 
mgAnl in rabbit milk. 

IGF-I drcnlates in. plasma in a &iily hi^ concentration varying between 1 20-400 ng^ 
In cirrhotic patients the values oflCSB-l £ill to 20 iigAnl and firequentty to undetectable levels. 
Replacement thei^ with IGP-l in liver dnhosis requires adrmnistration of 1 .5-2 mg p&r day for 

20 each patient Thus, every cirrhotic patient will consume about 600 mg per year. IGF-I is 
curroitly produced in bacteria (71). The high amount of recombinant protem needed fra: IGF-I 
replacement tiber^y in patients with liver cirrhosis will make this treatment exceedingly 
e?q[>ensive if new metiiods for che^ {Koduction of recombinant proteins are not devel(^)ed. 
Besides^ as described above, IC^-I is used in treatment of dwarfism, diabetes^ osteoporosis, 

25 starvation and hypercatabohsm. IGF-I use in osteoarthritis is currently being investigated. 
Again, plant biotechnology could provide a solution to make economically feasiUe the 
plication of IGF-I tiierqyy to all these patients. 



30 SUMMARY OF THE INVENTION 

The present invention develops recombinant DNA vectors for enhanced expression of 
human serum albumin, insQhn-hke growth &ctor I, and interfenm-0 2 and 5, via 
chlon^last genomes of tobacco, 
optinuzes processing and punficaticm of pharmaceutical proteins using chloxoplast vectors in K 
35 coli, and 
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_ aiic tobacco plants. 

Hie transgraiic eTq^ression of proteins or fusion protdns is charactezizBd nsing molecnlar and 

biochemical methods in diloroplasts. 
Existing or modified methods of porLfication are en^loyed on transgenic leaves. 
5 Mendelian or maternal inheritance of. transgenic plants is analyzed. 

Laige scale purification of ^isstapctx&c proteins from transgenic tobacco and conxparison of 

cunent purification mediods mE,coU oz yeast is perfomed^ and 
natinal refolding in clilaroplasts is con^^ 

CMnparison/charactCTiyatfon ^eld and purity) of fterapeutic piDtein s produced in yeast or 
10 Ka^ wilh transgenic tobacco chloroplasts is pezfimned, as are 

In vitro and in vivo Qxre-clinical trials) studies of protein biofimcttonality. 

DETAILED DESCRIPTKm OF THE INVENTION 

When the concq>t of chloroplast genetic engineering was dcvdc^cd (72>73)» it was 
1 5 possible to introduce isolated intact chloroplasts into protoplasts and regenerate transgemc plants 
(74). Therefor^ eariy inyestiga^tions on chloroplast transfonnation focused on the developmait 
^ of xhorsBReOb systems using intact (^oro 

and translation (75-77) and expression of foreign genes in isolated chloroplasts (7R). Howev^^ 
afier the discoveiy of Ifae gene gun as a tiansfnnnatian device (79), it was possible to transfcon 
20 plant dbloroplasts without die use of isolated plastids and protoplasts. CEloroplast gen^c 
engineering was accomplished in several phases. Transient expression of finreign genes in 
plastids of dicots (80»S1) was followed by sudi studies in monocots (82), Umqne to the 
chlorc^last ^netic engineering is tibe devel<^»nent of a foreign gene esqxression system using 
autonomously replicating chloroplast expression vectors (80). Stable integration of a sdectable 
25 marker graie into tlie tobacco ddoroplast genome (83) was also accoiiq)Iished using the gene 
gun. Howevrar, useful genes conferring vahuible traits via cMoroplast genetic enginemng have 
been demonstrated only leoendy. For mniple, plants resistant to B.t sensitive insects were 
obtained by integrating file GryC4c gene into the tobacco chloroplast genome (84). Plants lesistaDt 
to B.t resistant insects (iq> to 40,000 fold) were obtained by hyper-eaqnession of the ay2A gene 
30 widnn die tobacco chlorpplast geninne (85). Plants have also been gpnekicaUy eogineeved via the 
chloroplast genome to confer herbicide resistance and the introduced foreign genes were 
maternally inherited, overcoming the problem of out-cross with weeds (86). Chloroplast genetic 
engineering technology is currratly being ^lied to other useful crops (73,87). 

A remarkable feature of chloroplast genetic engjnerang is flie observation of 
35 exceptionally large accumulation of foreign proteins in transgenic plants, as much as 46% of 
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n total soluble protem, even in bleached old leaves Q). Stable expression of a 
pbaiinaceulical protein in chloroplasts was first cqx>ited for GVGVP, a |HOtem based polymer 
with varied medical appHcations (such as the prevention of post-surgical adhesions and scars^ 
wound coverings^ artificial pericanha, tissue reconstruction and piogranmied drug delivery (88)). 
5 Subsequently, expression of the human somatotropin via the tobacco chloroplast genome (9) to 
high levels (7% of total soluble protein) was observed. The following investigations that are in 
{sogress m the Darnell laboratory iUustiate die power of this tedmology to express small 
peptides, entb» opecoos, vacdnes fiiat require oHgomecic protems witti staUe disulfide bridges 
and moiiockmals that require a8senib()f of heavy^g^ 

'10 In plant and animal cells, miclearmRNAs axe translated mon^ 

sodous problem when engineering multiple genes in plants (91). Thopefoie, in oxder to eaq>ress 
the p(^hydroxybutyrate polymer or Guy^s 13 antibody, single genes were first mtroduced mto 
individaal transg^c plants, then these plants were back-crossed to reconstitute the entire 
pathway or the complete protein (92,93). Similarly, in a seven year long effort, Yc ct aL (81) 

15 recently introduced a set of dnec genes for a short biosynthetic pathway that resulted in P- 
carotene expression in rice. In contrast, most chloroplast gmes of higiher plants are cotranscribed 
(91). Bxpressbn of polycisttons via the chloroplast g^iome provides a unique c^jportumty to 
ejqoess entire pa&swdys in a smgle transformation event The BactUus thmingiettsis (Bt) 
cr><2Aa2 operon has recertfly bemused as a model system to demonstrate opmn ei]q)ie8sion and 

20 crystal finmation via Ate chlGrq>]ast genome (3). CrylAs^l is tiie distal gene of a Hxree-gene 
operon. The orf immediately upstream of co^Aa2 codes &r a putative chiqperonin that fecilitates 
the folding of cryZAaZ (and other |Hx>tems) to fbnn proteolytically stable cuboidal crystals (94). 

Therefore, the cry2Aa2 bacterial operon was expressed in tobacco chloroplasts to test the 
resultant transgenic plants for increased expression and ino^xroved persistence of the accumulated 

25 insectiddal protein(s). Stable foreiga gene integration was confirmed by PGR and Southern blot 
analysis in To and Ti transgenic plants. Qy2Aa2 openm derived protein accumulated at 453% of 
tiie total soluble protein in mature leaves and remained stable, even in old bleached leaves 
(46Ll%)(Figqre 1). This is tiie hig^iest level of foreign gene esqaiession ever report»i in 
transgenic plants. Bxceedipgly difiScult to control insects (lOnlay old cotton bcdhvorm, 

30 beetanny worm) were killed. 100% afier consmniog transgenic leaves. Electron mlcrogiap hs 
showed die presence of the insecticidal protein folded into cuboidal crystals similai in sh^ to 
Cry2Aa2 oystals observed in Bacillus tkuringiensis (Figure 2). In contrast to currently mftTyfitp^ 
transgenic plants witiki soluble CRY proteins, folded protoxin crystals will be processed only by 
target msects that have alkaline gut pH; this approach should irqprove sai^ of Bt transgenic 

35 plants. Absence of insecticidal proteins in transgenic pollen eliminates toxicity to non-target 
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UeD. Li addition to these environinentally Mendly ^proaches, this observatum 
shraild serve as a model system for laige-scale production of foieigD proteins within chloroplasts 
in a folded configoratiGn enhancing their stability and facilitating single step pnxificaticm. This is 
the first demonstratic»i of e^qnession of a bacterial opmn in transgenic plants- and opens the 
5 door to engineo: novel pathways in plants in a single transformation event 

It is common knowledge that the medical conummity has been fighting a vigorous battle 
against drug resistant pathogenic bacteria fiir years. Cationic antibacterial peptides fiom 
mammal ^ a mphibi ans and insects have gained more attention over the last decade (95). Key 
features of these cationic pq^tides aie a net positive charge^ an a£GDEiity far negativefy-^bazged 

10 prokaryotic membiaxie phoqxholqxids ove^ 

to form aggregates that disri^t the bacterial inembrane (96). 

There are tiuree major peptides witii a-belical structures, cecropin fiom Ifyalcphora 
cecropia (giant silk moth), magainins from Xenopus laevis (AMcan frog) and def^i^ns from 
m a mm a li a n neutrophils. Magainin and its analogues have been studied as a broad->spectmm 

15 topical agent, a systemic antibiotic; a wound-healing stimulant; and an anticancer agent (97). We 
have rccendy observed dial a synthetic lytic peptide (MSI-99, 22 amino acids) can be 
successfully expressed in tobacco chloroplast (98). The peptide retained its lytic activity against 
die phytopathogenic bacteda Pseudomonas syHngae and multidrng resistant human pathogen, 
Pseudomonas aerugmosa. The anti-nucrobial pqptide (AMP) used in tiiis study was an 

20 ampfaipatiuc a]ptha4idiz molecule that has an affini^ for negatively charged phospholipids 
commonly found in tlie outerHnaabiane of bacteria. Upon contact with these membranes, 
individual pcp&dcs aggregate to form pores in the monbrane, resulting in bacterial lysis. 
Because of the concentration dependent action of the AMP, it was expressed via tiie chloroplast 
genome to acconqsHshhi^ dose delivery at the point of infection. PGR products and Soutiiem 

^ blots confirmed chlinroplast integration of the foreign genes and homoplaamy. Growth and 
developmeol of the tnmsgraic plants was imaffrffted by hyper-ez]H^ssion of the AMP within 
cfalocoplasts. In vitro assays with To and Ti plants confirmed tiuit tiie AMP was expressed at 
high levds C^l.5 to 43% of the total soluble protem) and retained biological activity agamst 
Pseudomonas syrir^ae, a major plant pathogen. In situ assays resulted in intense areas of 

30 necrosis around the point of infoctian in control leaves, \riiile transf or med leaves showed no 
signs of necrosis ^00-800 ^g of AMP at die site of mfecdon)(Pigure 3). T] in vitro assays 
against Pseudomonas aeruginosa (a multi-drug resistant human pathograi) displayed a 96% 
inhibition of growth (Figure 4). These results give a new option in the battle against 
{diytopathogenic and dnig-resistant himaan patfai)^^ Small peptides (like insulin) aie 
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tost oiS9]usm& However, stability of ttiis AMP in cUcxroplasts opens iq> lius 
CQiiq>artmeQt for eaqxression of honnomes and other small pqytides. 

Expression of cholera toxin p subunit oligomers as a vaccine in chlon^lasts 
5 Vibrio cholerae, whicli causes acute wateiy dianfiea by colonizing the snmll intestine and 

producing the enterotoxin^ cholera toxin (CT). Cholera toxin is a hexameric AB5 protein 
ccNisisting of one toxic 27kDa A subunit having ADP libosyl transferase activity and a ncxitoxic 
pentamer of II .6 kDa B sobunits (CTB) that binds to ttxe A subunit and faciKtates hs raliy into 
the intestinal qnfiielial cells. CHB ^Ktoi adiiiiniste^ 

10 which can neutralize the taxidty of the CT holotoxin by preventmg it fiom binding to the 
intestinal cells (100). Tins is believed to be a result of it binding to eukaryodc cell sor&ces via 
die G|^] g9nglioside8» recqytois presmit on die intestinal epithelial surfece, dnis eliciting a 
mucosal immune reqxmse to patfao^ms (101) and wijumr ltig tiie iimnane response when 
chemically c(nq>Ied to other antigens (102-105). 

15 Cholera toxin (CTB) has previously been expressed in nuclear transgenic plants at levels 

of 0.01 (leaves) to 03% (tubers) of the total sohible proteiiL To increase expression levels, we 
engmeered the chlon^last genome to express the CTB gene (10). We observed expre^on of 
oligomeric CTB at levels of 4-5% of total soluble plant protein (Figure 5 A). PGR and Soudiem 
Blot analyses confiimed stable integration of die CTB gene into die chkioplast gpnoma Western 

20 blot analysis showed that transgenic chlorpplast e^qiressed CTB was antigenically identical to 
commeicialty available purified CTB antigen (Rgure 6). Also, G]vix~S^S^^^ binding assays 
confimi diat ddoroplast syndiesized CTB binds to the intestinal memibrane receptor of cholera 
toxhi (Rgure 5B). Tran^gedic tobacco plants were morphologically indistinguisbable fioom 
untransformed plants and die introduced gene was found in be riahly inherited m tfte snbgeqneiit 

25 goaeration as confitmed by PCR and Southern Blot analyses. Hie increased ]nt)duction of an 
efBcient tiansmucosal carrkr molecule and delivery syst^tn, like CTB, in diloroplasts of plants 
makes plant based oral vaccines and fiision proteins with CTB needing oral administration, a 
much more feasible approach. This also establishes uneqmvocaUy that '^^oroplasts are capable 
of forming disulfide bridges to assemble fbrciga proteins. 

30 

Expression and assendily of monodonals In transgenic chloroplasts 

Dental caries (cavities) is probably die most jMnevalent disease of HiimanVinf! 
Colonization of tee& by £ msi/any is die sii^e most in^>ortant risk factor 
dental caries. iSl mutans is a non-^notfle, gram positi:ve coccus, ft cokmizes tooth sur&ces and 
35 synthesizes Queans (insoluble polysacchr^'*'*'^ foictans fiom sucrose using the enzymes 
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rase and finctosyitransferase lespectively (106aX The gbcans play an important 
role by allowing the bacteriuni to adhere to the smooth tooth sur&^ After its adherence, the 
bactermm ferm^ts sucrose and produces lactic acid Lactic acid dissolves ^e minerals of the 
tooth, producing a cavity. 

5 A topical monoclonal antibody fhexBpy to prevent adherence of S, mutans to teeth has 

recent^ been developed. The incidence of caiiogenic bacteria (in hnmans and animals) and 
dental caries (in animals) was dramatically reduced fixr periods of iqp to two years after the 
cessattcm of the antibody diec^. No advose emits were detected ed 

or in human vohmteers QOGt^ The annual requirement for this antibody in the US alone may 
10 eventually exceed I metric ton. TherefoEr^ this antibody was eoqiEessed via the chloioplast 
genome to achieve higher levds of esqpiession and proper folding (11). The integration of 
antibody genes into to cfalaroplast genome was confirmed by PGR and Southern blot analysis. 
The expiession of both heavy and lig^ chains was confimied by western blot analysis nnder 
redndng conditions (Figure 7A3X The expression of fiiUy assembled antibody was confirmed 
15 by western blot analysis under noo-ieduciag conditions OFigure 7CX This is the first rq>ort of 
successful assembly of a muM-subunit human protein in transgenic cfalorqplasts. l^ooduction of 
moiKxdonal antibodies at agdcuhuial level should reduce tibeir cost and create new aj^cations 
of manoclonal antibodies. 

miMAN SERUM ALBIJMIN 

- 20 Nnclear transformation 

The human HSA cDKA was cloned fiom human liver cells and the patatin promoter 
(^ose esqnession is tuber specific (107)) fuswl along with the leader sequence of PIN U 
(proteinase n inhibitor potato transit peptide that directs HSA to the apoplast (108)). Leaf discs 
of Desiree and Ki^mefoec potato plants were transformed mmgAgrobacterium tmnefaaens. A 

25 total of 98 txansg^iic Desiree dones and 30 Kennebec clones w 

blots. Western blots showed that the recombinant albumin (cHSA) had been properly dieaved by 
the psotdnase n inhibitor transit peptide (Figure 8). Expression levels of both cnbivars were very 
different amc»ig all transgenic clones as expected (Figure 9), probably because of positicm effects 
and gene silencing (89,90)l The population distribution was similar in bofiicnltivais: majority of 

30 transgenic clones showed ciqnession levek between 0.04 and 0.06% of dlSA in tiic total soluble 
protein. The maximum recombinant HSA amount e^qiressed was 0.2%. Between one and five T- 
DNA insertions p«- tetiaploid genome were observed in these clones. Plants with higher protein 
exfH^sion were ahvays clones with several copies of the HSA gene. Levels of mRNA were 
analyzed by N(Mlhem blots. There was a correlation between transoipt levels and recombinant 

35 albumin accumulation in tcansgeoic tubers. The N-terminal sequence showed proper cleavage of 
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^ tide and the amino terminal sequence between leoomibinant and human EGSA was 
identicaL InhibitiQn of patatin esquession nsing tiie antiseiise tedmology did not inqnove the 
amount of iHSA^ Average expression level among 29 transgenic plants was 0X3(32% of total 
soluble protdn, wi& a Tnaximiim e3q>ressian of 0.1%. 
5 Transformation of the tobacco chloioplast genome was initiated for hypere^qnressiQn of 

HSA. The codon compositicMi is ideal for chloioplast expies^on and no changes in nucleotide 
sequences were necessaiy. For all the constracts pLD vector was used. Several vectors were 
deigned to optimize HSA oqpressioa All diese contained ATG as Hts first amino acid of the 
mature patoti^n. 
10 RBS-ATGaSA 

The first vector included the gene that codes for the mature HSA phis an additi(»ial ATG 
as a translation initiation codon. We indoded the ATG in one of the primers of Ae PGR, 5 
nucleotides downstream of the ddcnroplast preferred RBS sequence GGAGG, The cDNA 
sequence of the mature HSA (cloned in Dr. Mingo-Casters kboratory) was used as a tcnq>late. 
15 The PGR iHxxhict was cloned into PGR 2.1 vectc^, excised as an EcoRI-NotE fiagmcnt and 
introduced into the pLD vector. 
5*Un^bA-AT&HSA 

The 200 bp tobacco ddoroplast DNA fragment containing the 5* psbA UTR was 
anq>lified nsng PGR and tobacco DNA as template. The fragment was cloned into PGR 2.1 
20 vector, excised EcoRI4icoI fiagmoit was inserted at the Ncol site of the ATG-HS A and finally 
inserted into the pli> vector as an EcoRI<-Nod fragment downstream of the 16S £^ 
, to enhance translation of the protein. 
BtORFl+^ATG-HSA 

ORFl and OIUE^ of the Bt Cry24a2 opmn were an]^Iified in a PC^ 
25 opmm as a tenq>late. The fragment was cloned into PGR Z I vector^ excised as an BcoRI-BcoRV 
fragment, inserted at EcoRV site with the ATCr-HSA sequence and introduced into the pLD 
vector as an EcoRI-NotI fragment The ORFl and 0RF2 were frisediqistream of ^ ATG-HSA. 

Because of the simikrily of protein synthetic machinny (109X expressicxi of all 
chloroplast vectors was first tested in E.coU before their use m tobacco transfosmatioiL DifTecent 
30 levels of eaqiression were obtained in iSl cct/i dq^ending on the construct CKgure 10). Usmg the 
psbA 5' UIR and die ORFl and ORF2 of the cry2Aa2 operon, we obtained hi^ier levds of 
expression than using only d» RBS. We have observed in previous experiments that HSA in K 
coii is completely insoluble (as is shown in ref 14), probably due to an improper folding resulting 
from the absence of disulfide bonds. This is tht reason why the protein is precipitated in ^ gel 
(Figure 10). DifTereait polypeptide sizes were obs^ed, probably due to inconoqplete fTffnslfttion 
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K coU and chl<Hop]ast have siinilarpiotem synthesis machmecy, one could e^qiect 
different levels of expressLon in transgenic tobacco chloroplasts d^>eQding on the regolatoiy 
sequences, with the advantage that disulfide bcmds are formed in chloioplasts (9). These Aree 
vectors were bombarded into tobacco leaves via particle bombardment (110) and after 4 weeks 
5 small shoots q)peared as a result of independent transformation events. 

1NTERFERON-D5 

Inteifeon-D5 has not been e^qxressed as a c<»nmercial recombmant protein. Thefibrst 
attenqit has been made recentfy. The IFN-OS giene was cloned and tiie sequoice of the mature 
protein was inserted into the pBT28 vector, ftiaX inchided the ATCs ^»^'<<tnft tag for pmificatton 
and thrombin cleavagp sequencesL The tagged IFN-DS was purified first by bhiding to a nlc&el 
column and biotinylated thrDmhin was then used to eliminate ^ tag on IFN-DD. Biotinylated 
thrombin was removed fitmi the preparation using streptavidin agarose. The ^q)ression level was 
5.6 micrograms p^ liter of broth culture and the recombinant protein was active in antiviral 
activity similar or higjbter than commercial IFN-D2 (hitron A, Schedng Plouth). 

Insnlin-libe Growth Factorrl flCT-D 

Recent studies have demonstrated that tneatment with low doses of IdP-I induced 
significant improvements in imlritional status (52^ intestinal absoiption (53-SSX osteopenia (56), 
20 hypogonadism (57) and liver fimction (58) in rats with experimental liver drchosis. These data 
siqpport that IGF-I deficiency plays a padiogenic role in several systemic complications occoning 
in liver drdiosis. Qinical studies are in progress to ascotain die role of IGF-I in the 
management of ciidiotic patients. Unpublished studies indicate ^lat IGF-I could also be used in 
patients with articular degenerative disease (osteoarduitis). 

25 

Bxperirp g»fa^ 
Example 1 

Evaluation of chloroplast gene e:qiression 

A systematic i^pproach is used to identify and overcome potential' limttatioas of fixrdgn 

30 gene expression in chloroplasts of transg^e plants. This esapmsossA increases the utility of 
diloiK^last transfoimation system by scientists in t e rested in e3q>re8sing other foreign proteins. 
Therefore, it is in^xjrtant to systematically analyze transcription, RNA abundance, RNA 
stability, rate of protein synthesis and degradation, proper folding and biological activity. The 
rate of transcription of the introduced HSA gene is conqiared with the hig^y expressing 

35 ^ endogenous chloroplast genes (rfocl^ psbA» 16S iKNA), using run on transcription assays to 
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}e 16SrRNA pn^note is operating as expected. The transcription eE^ency of 
transgenic diloroplast containing each of die three constructs with different 5' regions is tested. 
Similarly, transgene RKA levels are monitored by northerns^ dot blots and primer extension 
relative to endograioos rbcl^ 16S rRNA or psbA. These results,* along with run on transcription 
5 assays, provide vahabk mformation of RNA stability, processing, etc. RNA appears to be 
extremely stable based an noitfaem blot analysis. This systematic study is valuable to advance 
utility of this system by odier sdeudsts. Most importantly, the efGciency of translation is tested 
in isolated chloioplasts and conqpared witii &e hi^y translated chloroplast protein 0>sbA). 
Pulse chase esqiennients help assess if traoslatioDal pausing^ premature teminatifw occurs. 

10 Evaluation of percaat RNA loaded on polysomes or in constnsctswidi or wi 

to determine the efficiency of the nbosomB binding site and 5' stem-loop trai»lational enhancers; 
Codon <qptimized genes (lOF-I, IFN) are compared with unmodified genes to investigate the rate 
of translation^ pausing and t^minatioa A 200-fbld difference in accumulation of fbrdgn 
proteins due to decreases in proteolysis conferred by a putative chaperonin P) was d>servcd 

15 ^ Therefore^ proteins fiom canstructs esqxressing or not expressing die putative di^)eronin (with or 
without ORP142) provide valuable infixrmation on protein stability. 

Example 2 

Egression of the mature protein 

20 HSA» Interferon and IG^-I are pre-pioteins diat need to be cleaved to secr^ mature 

proteins. The codon for translation initiation is in the presequence. In chloroplasts, the necessity 
of expressing die mature protein forces introduction of this additional amino ac^d in coding 
sequences. In order to optimize esqsression levels^ we first subcloaie the sequence of the mature 
proteins beginning with an ATG. Subsequent immunological assays in mice demonstrates the 

25 extra-methionme causes immunogenic response and low bioactivify. Altematively, systems may 
also produce the mature proteixL These systems can include the synthesis of a protein fused to a 
peptide that is cleaved intracellnlary processed) by chlorpplast en^rnes or the use of chemical 
<nr enzymatic deavagp after partial purification of proteins firom plant cells. 

30 Use of peptides that are deaved in chloroplast 

Staub et aL (9) reported chloroplast ejqnression of human somatotropin snnilar to the 
native human protein by using ubiquitin fusions that were cleaved in the stroma by an ubiquitin 
protease. However, the processing efficiency ranged fiom 30-80% and the cleavage site was not 
accurate. In* order to process chloroplast expressed proteins a pq>tide whidi is cleaved in tiie 
35 stroma is esseotiaL The transit pq)tide sequence of the RuBisCo (ribulose l>-bisphosphate 
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mall subunit is an ideal choice. This transit pqytide has been studied in depth 
(111). RuBisCo is one of the proteins that is synthesized in cytoplasm and tianqxxrted 
postranslatioiuiUy into the chloioplast in an energy depradent process. The transit pq)tide is 
proteolytically removed vpm transport in the stroma by the stromal processing pqitidase (112). 
5 There are several sequences descnhed for different ^)ecies (113). A transit peptide consensus 
sequence fertile RuBisCo small sobunit of vascular plants is published by Keeg3tra et al. (114). 
The amino adds that are proximal to the C-tenninal (41-59) aie bi^ofy conserved in the higher 
plant traxisit sequences and belong to the domain is in^^ 

The RuBisCo small snbunit transit peptide has been &sed mSh vanoos marker jxroteins 
10 (114>11SX even wiHi animal ^otesns (116,117), to target proteins to the cbloroplast f^or to 
transformation studies, the cleavage efficiency and accuracy are tested by in vitro translation of 
the fusion proteins and in orgtmetto inqx>rt studies using intact chloroplasts. Thereait^, 
knowing the correct fusion sequence for producing the mature protein, such sequence encoding 
the amino terminal portion of tobacco cbloroplast transit peptide is linked witii the mature 
15 sequence of each proteiiL Codon conqiosition of the tobacco RuBisCo small subunit transit 
peptide is oonqKitible with dilorqplast optinud translation (see section d3 and table 1 on page 
30). Additional transit peptide sequences for targeting and cleavage in the chknoplast have been 
desoibed (111). The hnnen of tl^akoids could also be a good target because thylakoids are 
readly purified. T Jimmal proteins can be freed dtfaer by sonicaticMi or with a very tow triton 
20 XlOO concentration, althou^ this requiies insertion of additional, amino acid secpences for 
effk^ent insaposX (1 1 1). 

Example 3 

Use of chemical or enzymatie deav^e 
25 Hie stiategy of fusing a protein to a tag with afSnity fix a certain lig^ has been used 

' extensively for more tiian a decade to enable afGnitypouification of recambinaiit products (118- 

120). A vast number of deavagis methods, hoUi dhemical and enzymatic, have been investigated 

for this purpose ( 120). Chemical cleavage methods have low specificity and tiie relatively harsh 

deavage conditions can icasult in Ghcmical modificatioxis of the released products (120). Some of 
30 tiic enacymatic metiK>ds ofiTer significantly higher cleavage spedfidties togsther with high 

efBcieocy, e. g. H64A aditilisin, IgA protease and fector Xa (119,120), but these enzymes have 

the drawback of being quite e3q)ensive. 

Trypsin, which cleaves Otennmal of basic amino-add residues, has been used for a long 

time to deave fusion proteins (14,121). Despite e^^ected low specificity, trypsin has been ^wn 
35 to be nsefiil for specific cleavage effusion proteins, leamg basic residues widun folded proteni 
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avaged (121). The use of trypsin only lequiies that the N-tenninus of the mature 
protoin be accesable to the protease and tiliat the potential intemai sites are protected in &6 
native conformation. Trypsin has the additional advantage of being ine?q)eQsive and readily 
available. In the case of HSA, when it was expressed in K coli with 6 additional codons coding 
5 for a trypsin cleavage site, HSA was processed successfully into the mature protein after 
treatment with the protease. In addition, tiie N-terminal sequence was found to be unique and 
identical to the sequence of natural HSA, tiie conversion was complete and no degradation ' 
products were observed (14). This in vUro maturatiGa is selective because cokrecOy folded 
albumm is hi^^iesistant to tiypsin cleavage at inner sites (14). This sjystem could be tested for 

10 chloroplasts HSA vectoiB using pn>te]ne7spre88edia£co^^ 

Staub et aL (9) demonstrated that the chloroplast methionine aminopeptidase h active and 
they found 95% of removal of the first methionine of an ATG-scmatotropin piotein tiiat was 
e^qsressed via the chloroplast genome* There are several investig^tions that have shown a very 
strict pattern of cleavage by this peptidase (122). Mediionine is only rranoved when second 

15 residues are gl^cine^ alamne^ s^dn^ cysteine, threonine^ proline or valine but if .the third amino 
acid is proline the cleavage is inhibited. In the e;qxression of our three protrans we use this 
approacii to obtain foe mature protein in the case of foterferon because foe penultimate 
aminoacid is cystein followed by aspaitic add. For HSA the second aminoacid is aspaxtic acid 
and for IGF-I g^lydne but it is followed by proline, so the cleavage is not dependable. 

20 For IGF-I e^qpression, the use of foe TEV protease (Gfoco cat n 10127-017) would be 

ideal The cleavage site that is recognized for this protease is Glit-Asn-LeurTyr-Phe-Ghi-Gly and 
it cuts betwe^ Gln-Gly. This strategy allows foe release of the mature protein by incubation 
with TEV protease leaving a glycine as foe first amino acid consistent wifo biiim r^ mature IGF-I 
protein. 

25 The punficationfiystem of foe £1 a)il LiiteifoEon-DS eiqiressionmefood was 

Histidine-tags that bind to a iddDBl cotumn and biotinyiated thrombin to ftltminate tbe tag on 
IFN-05. Tfaimifom recognizes Leu-VakPio-Arg-Giy-Ser and cots between Aig and Oly. This 
leaves two' cxlia amino acids in foe mature protein, but antrviial activity studies have shown foat 
this ptotdn is at least as active as commeicaal IFN-Q2. 

30 

Example 4 

Optfodzation of gene expression 

Foreign genes are expressed between 3% {oyMai) and 47% {py2Aa2 operon) in 
transgenic chlon^lasts (3,85). Based on foe outcome of the evaluation of HSA chloroplast 
35 transgenic plants, several approaches can be used to enhance translation of the recombhiant 
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I(myp]a5ls» transcriptiond regokti on of g^e 
some modulations by light and developmraital conditions are observed (123). KNA stability 
appeals to be one among the least problmis because of observation of excessive accumulation of 
f(»:eign transcripts, at times 16,966-fold higher ifaan the highly expressing nuclear transgenic 
5 plants (124). Chloroplast gene expression is regulated to a large extent at the post-transcr^tianal 
level For exan^le^ 5* UTRs are necessary for optimal translation of chloroplast mRNAs. Shine- 
Dalg^tno (GGAGG) seqaeoces^ as well as a stem-loop strocture located 5* adjacent to the SD 
sequence^ are reqinred for efficient translation A recent study has shown fltatinsertiofn of the 
psbA 5* UIR dowBstieam of the 16S iRNA promoter enhanced translation of a fivetgn gene 

10 (GUS) hondied-fbld (12Sa). Thacefoie* Ibe 200-bp tobacco cblorqda^ 

14S0) containing 5* psbA UTR should be used. This PGR product is inserted downstieam of the 
168 iRNA promoter to enhance translation of the recombinant proteins. 

Yet anoth^ ^[)proach for eohanc^ent of translation is to optknize codon con^si 
Since all tiic three proteins are translated in K coli (see section h\ it would be reasonable to 

15 expect e£5ciCTt e^zres^on in chlozoplasts. However, optimizang codon ca^^x)sitions to match 
the psbA gene codd fintfacr enhance the levd of transbtion. Although ibcL (RoBisCX)) is the 
most abundant protein on eardi, it is not translated as as the psbA g^e due to the 

extremely Mg^ turnover of the psbA g^ product Hie psbA gene is under stroi^er selection 
for increased tmnslation efficiency and is the most abundant tiiylakoid protein. In addition^ the 

20 codon usage in hig^ plant chloroplasts is biased towards the KNC oodon of 2-fold degenerate 
groups (Le. TTC over TTT, OAC over GAT, CAC over CAT, AAC over AAT, ATC over ATT, 
ATA etc.). This is in addition to a strong bias towards T at tiiiid position of 4-fold de^oate 
groq[>& There is also a context effect that sho^d be taken into consideration wfail 
specific codons. The 2-lbld degenerate sites immediately upstream from a GNN codon do not 

25 show this bias towanlsNNCCirrGGA is preferred 

to TIT CGT, TTC ACT to TTT ACT and TTC TCT to TTT TCTXl25b,126). hx addition, 
hig^y expressed chknoplast govs use GNN more fiequently that other genes. Codon 
composition was cfxtunized by comparing different species. Abundance of amino adds in 
ddoroplasts and fRNA anticodons present in chloipplast nmst be taken into cansidBratian. We 

30 also compared A+T% content of all fixrejgn genes ttiat had been eaqircssed in transgenic 
chloroplasts in our laboratory with tiie percentage of chloiopla^ expression. We £xmd that 
hitler levels of A+T always correlated with hi^ expressicm levels (see table 1). It is also 
possible to modify chloroplast protease recognition sites while modifying codons, without 
afifecting &eir biological functions. 

35 Ibe study of the sequoices of HSA, and Interferon-QS was done. Ibe HSA 
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ed 57% of A+T content and 40% of total codons matched with the psbA most 
translated codons. According to the data of table 1» we expected good cfaloroplast expression of 
the HSA gpne without any modifications in its codon composition. IFN-D5 has 54% of A+T 
content and 40% of matching witii psbA codons. The conq)ositioii seems to be good but this 
5 protein is small (166 amino acids) and the sequence was optimized to achieve A+T levels close 
to 65%. Finally, the analysis of the IGF-I sequence showed that the A+T content was 40% and 
only 20% of the codons are the most translated in psbA« Therefore, this giene needed to be 
optinuzed. Opttmizatiini of Uiese two g^es is done using a novel PGR i^ipEoacfa (127»128) 
has been sucoessfiilly used to optimize codon conqK>sition of other fanmaa proteins. 

10 

Example 5 

Vector constructions 

For aQ the constructs pLD vector is used. This vector was developed in this laboratory for 
chloroplast transformation. It contains die 16S rRNA promoter (Ptm) driving die selectable 

15 madcer gene aadA (aminoglycoside adenyl tzansferase conferring resistance to specdnomycin) 
followed by tl» psbA 3* region (the terminator fipom a gene coding for phc^system II reaction 
center oonqx>nents) from the tobacco chloroplast gename. Hie pLD vector is a imiveisal 
cblofoplast expression /integration vector and can be used to transform chlotoplast genomes of 
sevacal odier ptam qiedes (73,86) becaiise these flank±3g seq^ 

20 hig^ plants. The universal vector uses tmA and imi genes (chloroplast trans&r RNAs coding 
for Alanine and Isolevcine) fiom the inverted repeat region of the tobacco chloroplast genome as 
flanfcing sequences for homologous recombination. Because tiie universal vector integrates 
foreign g^nes within die Inverted Repeat regioa of the chloroplast genome^ it should double the 
cc^y immber of the transgjene (horn 5000 to 10,000 copies per cell in tobacco). Furthermore^ it 

25 has been demonstrated that homoplasmy is achieved even in the first round of selection in 
tobacco probably because of the presence of a cddoroplast origm of rq>lication wittiin die 
flanldng sequence in fte univeisal vector (tbaAy providhig mm templates for integration). 
Because of these and several other reason^ foreign gene expression vras shown to be much 
faigber when die universal vector was used instead of the tobacco 

30 

The following vectora are used to optimize protein expression, purification and prodnctum of 
proteins with the same amino acid composition as in hurwan proteins. 

a) In order to opthnize e3q>ression, translation is increased using die psbA 5*Urni and 
35 optimizing the codon conq>osition for protein e]q>ression in chloroplasts according to cnteiia 
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sreviousty. The 200 bp tobacco chloroplast DHA fragment containing psbA 
UTR is anq)lificd by PGR using tctecco chloroplast DNA as template. This fiagmwit is 
ckmed direcdy in the pLD vector multiple cloning site (EcoRI-NcoI) downstream of the 
promoter and the aadA gene. The cloned sequence is exactly the same asin thepsbA gene. 

5 

b) For enhancing protein stability am) fecilitating purification, the cry2Aa2 Bacillus 
thuringlensis openm derived putative chaperonin is used Expression offt&'ay2Aa2 opeion 
in dilotoplasts jnxmdes a model system fat hyper-expressi(»i of fiardgn pioteins (46% of 
total sohible protein) hi a folded configuration enhancing their stability and feriiitftting 

10 purification (3). This justifies inclusion of the putative cb^ieronin fioxn the ay2Aa2 opeton 
in C3ie of the newly designed constnicts. In this region there are two open reading fismes 
(ORPl and ORF2) aitd a ribosomal buidmg site (rbs). This sequence contains elements 
necessaiy for CrylAal crystallization which help to crystallize die HSA, IGP-I and IFN-0 
proteins aiding in the subsequent purification. Succcssfiil crystallization of other proteins 

15 using this putative ch£4)eromn has beoi demonstrated C^). We am^ aadORF2 
of die Bt Cry2Aa2 operon by PGR using the complete operon as tenqthte. The fiagment is 
cloned into a rcR 2. 1 vector and excised as an EcoKI-EcdRV ]noduc^ 
cloned directly into the pLD vector mult^le cloning site (EcoRI-EcoRV) downsfateam of the 
promoter and die aadA gene. 

20 

c) To obtain proteins widi the same amino acid composition as mature human proteins^ we first 
fuse all three goies (cockm optunized and native sequence) with die RuBisCo small subunit 
transit peptide. Also other constructions are done to allow cleavage of the protein after 
isolation from chtproplast These stmtegiea also allow affiniiy pirifj^fiq of fte proteins. 

25 

The first set of oanstmcts includes the sequence of each ]mtein beginning with an ATG, 
introduced by PGR using primers. Processing to get the ixiature protein niay be performed where 
the ATO is shown to be a problem (detramined by mice immunological assays). First, we use 
the RuBisCo small siibnnit transit peptide. This transit p^de is amplified by PGR using 

30 tobacco DNA as tcanplatc and cloned mto the PGR 2.1 vector. All genes are fused widi the 
transit peptide using a Mlul restriction site diat is introduced in tiie PGR primers fixr 
amplification of the transit pq>tide and genes coding for three proteins. The ^e fissions are 
msmed mto the pID vect(»rs downstream of the 5'UTR or ORFl+2 nging the restriction sites 
Ncol and EcoRV re^iectiv^. If use of tags or protease seqo^ices is necessary, such sequences 

35 can be introduced by designing primers mcluding these sequences and amplifying the gene with 
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mpleting vector constructioi^ all the vectors are sequenced to conSnn correct 
nucleotide sequence and in frame fusion. DNA sequencing is done using a Peddn Efaner ABI 
pdsm 373 DNA sequencing system. 

Because of &e siinilarity of protein synthetic machinery (109), eocpiession of all 
5 chloioplast vectors is first tested in E^li before their use in tobacco transfonnation. For 
Escherichia coH esqnession XL-l Blue strain is used K coli can be liansfoimed by standard 
CaClj transfonnation procedures and grown in TB culture media. Puziftcaticn, biological and 
inomunogCTic assays axe done using JBl coi^* ecqxressed pro^^ 

10 Example 6 

Bombardment, Regeneration and Characterization of Chloroplast IVansgmic Plants 

Tobacco {Nicotiana tabacum var. Petit Havana) plants are grown asepticaUy by 
gemination of seeds on MSO mediunL Tins medium contains MS salts (43 g/liter), 35 vitamin 
mixture (nryo-iru>sitol, 100 m^ter; tiuaminD-HCl, 10 mg/Uter; nicotinic acid» 1 mg/Hter; 

15 pyridoxine-HG» I mg/Iiter)» sucrose (30 g^ter) and phytagar (6 g/liter) at pH SA. Fully 
e7q)anded» dark green leaves of dxiut two nionUi old plants are used fixr bombardment. 

Leaves axe placed abaxial side 1^ on a Whatman No. 1 filter paper laying on the RMOP 
medium (79) in standard petii plates (100x15 mm) for bombardment Gold (0.6 |im) 
microprDjectUes are coated with plasndd DNA. (chloroplast vectors) and bombardmenis are 

20 caoied out with tfae biolistic device FDSIOOO/He (Bio-Rad) as described by Daniell (110). 
Following bombardment, petri plates are sealed with parafilm and incubated at 24**C under 12 h 
phot(^)eriod. Two days after bombardment, leaves are chc^kped into small pieces of ~5 mm^ in 
size and placed cm the selection medium (RMOP containing 500 ^g^ of spectinomydn 
dibydroddodde) with abaxial side touching die medium in deq> (100x25 nnn) petri plates ('^lO 

25 pieces per plate). The regenerated specdnomycin resistant shoots are chopped into small pieces 
('^2mm^ and subcloned into fiesh deq^ petri plates (-^5 pieces per plate) containing the same 
selection mediimi. Resistant sihoots from the second culture cycle are then transifoied to the 
rooting medium (MSO medimn supplemented with IBA^ 1 xng/liter and flpectinomycin 
dihydrochloride^500mg^ter). Rooted plaiits are transferred to soil and grown at 26*'C under 16 

3Q hour pthotoperiod conditioiis for fortber analysis. 

PGR analysis of putative transformants 

PGR is done using DNA isolated ftom control and transgenic plants in order to 
distinguish a) true chloroplast transformants fiam mutants and b) c^oroplast transfbrmants fiom 
35 nudear transfonnants. Prim^ for testing tfae presmce of tfae aadA gepe (that confers 
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lesistance) in txansgemc plants aie landed on ftie aadA coding sequence and 16S 
rRNA gene. In order to test chlon^last integration of tiie genes, one primer lands on aadA 
geFie while another lands an the native chloroplast genome. No PGR product is obtained wi& 
nuclear transgenic plants using this set of piimos. The primer set is used to test integration of the 
5 raitixe gene cassette witiiont any intRfna] ddetion or looping out fi ri n g homologous 
recombination. Similar strategy was used successfiiUy to con&m ddoitqplast integration of 
foreign genes (3,85-88). This screening is essential to eliminate mutants and nuclear 
transfiotmants. In order to conduct PGR anafyses in tiansgiBtuc plants^ total 0NA fiom 
nnbomibarded and tiansgBnic plants is isolated as described by Edwards et aL (129). Chloroplast 
10 transgenic plants containing the desired gene are flien moved to second round of selection in 
ord^to achieve hcnnoplasmy. 

Southern Analysis for homopiasmy and copy number 

Soudiran blots are ckme to determine the copy numbo' of ^xs introduced foreign ^aas per 

15 cell as well as to test homoplasmy. There arc several thmisand copies of the ddoroplast genome 
present in eadi plant celL Therefore, when foreign genes are inserted into the chloroplast 
B BT in me ^ some of fee cbkxofkist genomes have foreign genes integrated while others remain as . 
the wild type (heteroplasmy). Therefore, in order tor ensure that only the transformed gmome 
exists in cells of tcan^eiuc plants (homcplasmy), the selection process is continued. In order to 

20 confinn that the wild type gemnne does not exist at the end of the selection cycle, total DNA 
from transonic plants are probed with the chloroplast border (flanking) sequences (the tml-tmA 
fragment). When wild type genomes are present (heteroplasmyX the native fragment size is 
observed along with transformed genomes. Presence of a large fragment (due to insertion of 
foreign gjenes within the flanking sequoices) and absence of the native smaQ fragment confirms 

25 homoplasmy (85,86,88). 

The copy number of Ihe tnt^tated g^ is detmnined fay establishing homoplasniy for 
the transgenic chloroplast genome. Tobacco chlorqplasts contain 5000^10,000 copies of their 
genome per cdl (86). If only a fraction of the genomes are actually transformed, foe copy 
number, by de&ult, must be less foan 10,000. By establishing that in the transgenics the gene 

30 inserted transfinmed genome is the only one present^ one can establish that the copy number is 
5000-10,000 per cell This is usually done by digesting foe total DNA with a suitable restriction 
enzyme and probing with the flanking sequences foat enable homologous recombination into the 
chloroplast genome. The native fragment present in the control should be absent in the 
transgenics. The absence of native fi:agment proves that only the transgenic chloroplast gen(Hno 

35 is present in foe cell and foere is no native untransformed, ddovoplast gpnome, without foe 
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present Tbk establishes the homoplasmic nature of our tronsfixnnsnts^ 
sinuiltaiieously pnmdiiig us with an esthnale of 5000^10,000 copies of the fordgn genes per 
celL 

5 Northern Analysis for transact stability 

Nortiiem blots aie done to test Ihe efficiency of transci^ TotalRNAis 
isolated fiom 150 mg of fiozm leaves by usmg the '^lea^ Plant Total RNA Isolation Kit** 
((^agsn Iqc^ CSiatsworth, CA> RNA (10-40 pg) is denatmed by finmalddiyde treatment 
8q>arated on a agarose gel in the presence of formalddiyde and txanafened to a 
10 nitroceMttlose membrane (MSI) as described in Sambrook et aL (130), Ftobe DNA Qminsiilin 
gene coding region) is labeied by the landoniiyrim^ 

Tbe blot is pr&-hybridized, hybridized and washed as described above for southern blot analysisL 
Transccqpt levels are qoantified by the Molecular Analyst Program usixig the GS-700 imaging 
Densitometer (Bio-Rad, Hercules^ CA). 

15 

Expression and quantiflcation of the total protein expressed in chloroplast 

Qiloroplast esqiression assays are done for each protein by Western Blot Reoomblnant 
protein levels in transgenic plants are detennined nsing quantitatiye ELISA assays. A standard 
curve is generated using known ooooentrations and serial dihitions of recombinaiit and native 
20 proteins. Diffarent tissues are analyzed nsing young, mature and old leaves against these primary 
antibodies: goat anti-HSA (Nordic bmnunology), anti-IGF-I and anti-Ihterferon alpha (Sigma^ 
Bound IgG is measured nsing horseradish penmdase-labelled anti-goat IgQ. 

Inhoitance of Introduced For^;n Genes 

25 While it is unlikely ftat introduced DNA would move from Il» chloroplast gpnome to 

nuclear gen(»ne, it is possible that the gene could get integrated in the nuclear genome during 
bombardment and remain undetected in Soudiem analysis. Therefore, in initial tobacco 
transfonnants, some are allowed to self-pollinate, whereas others are used in reciprocal crosses 
witii control tobacco (tran^enics as female accepters and pollen donor^ testing for maternal 

30 inheritance). Harvested seeds (Tl) will be geminated on media containing spectinomydn. 
Achievement of homc^Iasmy and mode of inheritance can be classified by looking at 
germination results. Hnnopbisniy is indicated by totally green seedlings (86) ^Adleheteroplasmy 
is displayed by variegated leaves QBxk of p'gynentation, 83). Lack of variation in cfalarophyll 
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imong progeny also tmdoscores the absence of position efTect, an aiti&ct of 
nuclear transfonnation. Maternal inh^itaitce is be demonstrated by sole transmission of 
introduced genes via seed generated on transgenic plants, regaidless of poUen source (green 
seedlings on selective media). When transgenic pollen is used for pollination of control plants, 
5 resultant progeny do not contain resistance to chemical in selective media (will appear bleached; 
83X Molecular analyses confnn transmission and eg^^ ofintrDd0cedgenes^andT2seedis 
generated fipm those confirmed plants by the analyses described aibove. 

Bzamide? 
0 Pnrifieation methods 

The standard method of purification employs classical bioc^iemical techmques with the 
crystallized protdns inside tite chloroplast la this case, the homoger^tes aie passed tiirongb 
miracloth to remove cell debris. Centrifiigation at 10,000 xg pelletizes all foreign proteins (3). 
Proteins are sohibilized using pH, tcnqperature gradient, etc. This is possible if the ORFl and 2 
5 of the ay2Aa2 operon (see section c) can fold and crystallize the recombinant proteins as 
expected. Weietfiere is no crystal formatLon, other purificali(»iinettLodsm 
biodiemistcy techniqaes and afBiiity cohmms with jTO 

HSA: Albmmn is typically administered m tens of gram quantities. At a parity levd of 99.999% 
0 (a level considerBd soffident for odier recombinant protein prqMirationsX recombinant HSA 
(rHSA) in^urities on the order of one mg will still be injected into patients. So inqnmties &om 
the host (Hganism must be reduced to a minimimi. Fni&ennore, purified rHSA nmst be jdentical 
to human HSA. De^te ttiese stringent tequiiements, (HirificatiQn costs nmst be kept low. To 
purify the HSA obtained by gene manipulation^ it is not appropriate to ^ly conventional 
5 processes for purifying HSA originating in plasma as such. Uns is because the impurities to be 
dinimated from iHSA oon^>letely differ firom those coii^^ 

Namely, iHSA is contaminated witii, for exanq)le, coloring matters characteristic to recombinant 
HSA, protems originating in die host cdls, polysacchandes, etc. hi particular, it is necessary to 
sufiBcieaitiy eliminate oomponeDts originating in the host cells, since th^ are fixrdgn matters for 
D liviiig oigaiusmsiiichidiQg human and can cause the pn>bl6m of anl^^ 

In plants two different methods of HSA purification have been done at laboratory scale. 
Sijmons et al. (23) transformed potato and tobacco plants with Agmbacierium ttmrfaciens. For 
tire extraction and purification of HSA, 1000 g of stem and leaf tissue was homogenized in 1000 
5 ml cold PBS, 0.6% FVP, 0.1 mM PMSF and 1 mM EDTA The homogenate was clarified by 
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rifiiged and the supernatant incubated for 4 h with 1^ ml polyclonal antiHSA 
coi^Ied to Reactig^l spheres (Pierce Chem) in the presence of 0S% Tween 80. Tb& complex 
HSA^ti HSA-Reactigel was collected and washed with 5 ml 0.5% Tween 80 in PBS. HSA was 
desoEbed finom the reactigel complex widi 2.5 ml of 0.1 M glycine pH 2.5, 10% dioxane» 
5 Immediately followed by a buffer exchange with Sqphadex G25 to SO mM Tris pH 8. The 
saiiq>le was than loaded on a HR5/S MonoQ amon exchange column (Phannacia) and elotBd 
with a Imear Nad gradioit (0-350 niM NaQ m 50 mM Tris pH 8 in. 20 min at Iml/min). 
[Reactions containiDg tib concentxated HSA (at 

isai 1(V30 Sq^shaiose 6 cdnmn (Phamiacia) in PBS at 03 ml/min. However* this method uses 
0 afSnity cohmms ^polyckmal anti-RSA) that are very expensive to scak-iq). Also the protein is 
released from the cohmm wi& O.IM glycine pH 23 that will most probably, denature the 
protein. Therefore^ this mettkod can suitably modified to reduce diese drawbacks. 



The second me^od is for HSA extraction and puiification £:Dm potato tubers (Dr. 
Mingp-Castel's laboratory). After grinding the tub^ in phosphate bufier pH 7.4 (1 mg/2ml), die 
hamogenate is filtered in nmacloth and centrifiiged at 14.000 ipm 15 mimitBS. Afier this st^ 
anoter filtratim of the snpematant in 0.45 Om filters is necessary. Then, chn)matogrq>hy of 
ionic exchange in FPLC using a DEAE Sepbarose Fast Flow cohmm (Ameisham) is reqnixed. 
fractions recovned are passed throng^ an afGnity cohmm (Bine Sepbarose &st flow Ameisham) 
resnltmg m a product of high purity. HSA purification based on either method is acceptatde. 

IGF-1: All earlier attenqits to produce IGF-1 in B, coli or Saccharomyces cerevisiae have 
resulted in misfblded proteins. This has made it necessary to pofonn additional in yitro refoldiiig 
or extensive sqsaration techniques in asdes to recover die nattve and biological form of the 
molecule, bi addition, IGF-I has been demonstrated to possess an intnnsic ttiermod|ynamic 
foldmg problem widi regard to quantitativeiy folding nito a nattve disnlfido-bonded. 
confiocmation in vitro (131). Samuelsson et aL (131) and Joly et aL (132) co-expressed lOF-I 
widi specific proteins of £ coU diat significantly improved die relative yields of oooecdy folded 
protein and conseqaently fodlitating purification. Samuelsson et aL (132) fiised the proteui to 
afSnity tag3 based on either the IgO-binding domain (Z) fiom St^hylococcal proteni A or the 
two serum albumin domains (ABP) fiom Strq>tococcal protein G (134). The fiision protein 
cooxx^X allows the ICSF-I molecules to be purified by IgG or HSA aflSnity chromatography. We 
also use this Z tags for protein purification including the double Z domain fiom & auret4s protein 
and a sequence recogioized by T£V protease (see section d.2). The fusion protdn is incubated 
with an IgQ cohmm wbere bindmg via die Z domain occurs. Z domainrIgG mtetBCtian is very 
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as high a£5mty» so contaTninant protems can be easily washed ofif the cohnm 
Inciibation of the cohima with TEV protease elutes matuie lOF-I fiom the cohmtn. TEV protease 
is prodoced in bacteiia in lai^ quantities fused to a 6 histidine tag that is used for TEV 
pmification* This tag can be also used to sqtarate IGF-I fiom contaminant TEV protease. 

5 

IFN-D: Iq Hie £L coli eaqpiession method iised» the puiificatioD system was based on using 6 
Histkline-t^ that bind to a nickel coliinm and biotmylated thrombin to eliminate the tag on 
IFN-D5. 
ExanqikS 

10 CharacterizatiiMi of tlie recomMnaiit proteins 

For the safe use of lecombinant proteins as a replacement in anyr of Hu^ cunent 
api^cations^ these proteins nnist be structurally equivalent and must not contain abnomial host- 
derived modi£cati(H^ To confirm oon^ance with tibese criteria we compare human and 
recombinant protdns using fbs cuxrently highly soisitive and hig}ily resolving techniqnes 
15 expected by flic regulatory aathoritics to diaracteiiip recombinant products (135X 

Amino add analysis 

Amino add analysis to confirm fhs correct sequence is perfbnned following o£f-lme vapour 
phase hydrotysis using ABI 420A amino acid derivatizer with an on line I30A 

20 phenylthiocarbam)d-amino acid analyzer (Applied Biosystems/ABI). N-tecndnal sequence 
analysis is performed by Edman degradation using ABI 477A protein sequencer with an on-line 
1 20A pheny Ithiohydantoin-amino acid analyzer. Automated C-t^minal sequence analysis uses a 
Hewlett-Packard G1009A protein sequeiicer. To confirm the G*tBcminal sequence to a greater 
mimber of residues* tiie CMerminal tryptic peptide is isolated fiom tryptic digssts by reverse^ 

25 phase HPLC. 

Protein f pldiiig and disidfide bridges fonouation 

Western blots wifli reducing and noMeducing g^ are done to chedc pni^ 
to visualize small proteins will be done in the presence of tricine. Ptotem standards (Sigma) are 
30 loaded to compare ^ mobility of the recombinant proteins. PAGE is performed on PhastOels 
(Hiarmacia Biotech). Protems are blotted and fben probed with goat anti-HSA^ interferon alpha 
and IGF-I polyclonal antibodies. Bound IgQ is detected with horseradish peroxidase-labelled anti 
goat IgQ and visualized on X-ray film using ECL detection reagents (Amersham). 

35 Tryptic mapping 
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firm the presoice of chlooroplast expressed protdns with disulfide linkages 
identical to native himian proteins, the samples are subjected ta ttyptic digestion followed by 
peptide mass mapping using matrix-assisted laser desoiption ionization mass spectrometry 
(MALDI-MS). Samples aie reduced with ditfaiotfaieitol, allgrlated with iodoaeetamide and then 
5 digested with trypsm comprising dsee additions of 1:100 enrqrme/substiate over 48h at 37%. 
Sufasequoitly tryptic peptides are separated by leverse-phase HPLC ona Vydac CIS cohmm. 

Massanaly^ 

EiectrDBpiay mass spectrcmietiy (BSMS) is pedfoimed using a VG Qoattro electcospacay 
10 mass s|>eclnxmet^. Sanqdes aore desalted prior to ana^sis fay reveise-phase HPLC using an 
acetxmitnle gradicait containing tnfhiocoacetic acid. 

CD 

Spectra aie measured in a nitrogjen atmosphere using a Jasco J600 spectiopolarimeter. 

15 

Chromatograiililc techniques N 

For HSA, analytical gel-permeation HFLC is petfimned using a TSK G3000 SWxl 
cohmm. Ftqpaiative gel pemieaticm chramatogn^hy of HSA is pexfoimed usmg a Sephaayl 
S200 HR column. The mon<Mner fiaction, identified by absorbance at 280 nm, is diaiyzed and 
20 reconcentiated to its starting concmtration. For IGF-I, the leversed-phase chromatogr^iliy the 
SMART system (Fbaimacia Biotech) is used with the mRPC C2/IS SC 2.1/10 column. 

VlscD^ 

This is a classical assay for recomhinant HSA. Viscosity is a characteristic of proteins 
25 relaled direc% to di0irsi^,shape» and confoonaticiL The viscosities 0^ 

HSA can be measured at 100 mg. Ml-1 in 0.15 M NaCl uang a U-tnbe viscosirneter (M2 type^ 
Pouiton, Setfe and Lee Ltdl, Essex. UK) at 25%. 

30 CMoroplast proteins are not known to be glycosylated. However there are no publications 

to confirm or xefiite tins assun^on. Therefore glycosylalion should be measured using a scaledr 
up version of the method of Ahmed and Fuith (136). 

Example 9 
35 . Biological Assays 
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ISA does not have enasymalic activity, it is not possible to ran biological assays. 
However, three difTerBnt techniques can be used to check IGF-I functionality. Ail of theni are 
based on the proliferation of IGF-I responding cells. First, radioactive ^lymidine iq^take can be 
measured in 3T3 fibroblasts, that express IGF-I leceptor, as an estimate of DNA synthesis. Also, 

5 a human megakaiyoblastic cell line, HU-3, can be used As HU-3 gnnvs in suspension, changes 
in cell number and stimulation of {^ucose uptake induced by IGF-I are assayed using 
Alamai^lue or ghicose consumption, req>ectively. AlamacBhie (Accumed International, 
' Westlake.OH) is reduced by nutochondrial eD:^me activity. The leduoed form of tiie zeagenk is 
fhionesccEnt and can be quantitatively detect e d, ^th an excitatian of 530 nm and an emission of 

0 ^mn. AlamaiBlue is added to ftece]bfbr24 hours afl6r2da^ 

of IC3?-I and in the absence of serum. Glucose consumption by HU-3 cells is then measured 
uang a colorimetric glucose oxidase procedure provided by Sigma. HU-3 cells are incubated in 
the absence of serum with different doses of IGF-L Glucose is added for S hours and gbcose 
concentration is Hica measured in the siq^ematant All three mediods to measure IGF-I 

5 functionality aze precise, accurate and dose dqpendent, with a linear range between 0.5 and 50 
ngftnl(137). 

The method to deteonine IFN activity is based on their anti*viral pn^>erties. This 
procedure measures fte ability of IFN to protect HeLa cells against the cytopafliic effect of 
enoephalon^ocaxditis virus (EMC). The assay is performed in 96-well microtitre plate. Fhrst, 

0 HeLa cells are seeded in the wells and allowed to grow to confluency. Thi^ the mefium is 
removed, replaced with medium containing IFN dilutions^ and incubated for 24 hours. EMC 
virus is added and 24 hours latsr ib& cytopadnc effect is measured. For that, the medium is 
removed and wells are rirtsed two times with PBS and stained with medryl violet dye solution. 
The optical density is read at 540 nm. The values of optical density are f»oporttonal to die 

5 anttviral activity of IFN (138). Spedfic activity is determined with reference to standard IFN-D 
(code S2/576) obtained finnn NIBSC 

Example 10 

Animal testing and Pre-CaiDtcal THals 

D Once albumin is produced at adequate levels in tobacco and the physicochemical 

properties of the product correspond to those of tiie natural protein, toxicology studies need to be 
done in mice. To avoid mice response to the human protein, transgenic mice carrying HSA 
gnomic sequences are used (139). After injection of none, 1, 10, 50 and 100 mg of purified 
rec<Hnbinant protein, classical toxicology studies are earned out (body weigji and food intake, 

5 animal behavior, piloeredion, etc). Albumin can be tested ft^r blood volume replaceinent after 
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eUmmate tlie ihiid from the peritoneal cavity in patients with liver cttdiosis. It has 
been shown that albumin infusion aft^ this maneuver is essential to preserve e£fective 
dicolatory volume and renal functicm (140). 

IGF>1 and IFND are tested fin- biobgical effects snwi^ in aniiD^ Specifically, 

5 woodchucks (/mimom mo^iizx) infected v^lfa the ■ 
oonsidered as the best animal model of hepatitis B virus infecticm (141). Pieliminaiy studies have 
^lown a significant increase in 5' oligoad^ylate synthase RNA levels by real time polymerase 
dKon reaction (PGR) in wooddnick per^toal blood mononuclear cells upon incubation with 
human IFND5» a proof of tfaebiolo^cal activity of die human IFNQS in wooddnick odis. For 01 

0 studies^ a total of 7 woodchncks chronically infected wi^ snr&ce antig^ and 

WHV-DNA positive in serum) are used: 5 animals are itrjected subcutaneously wi& 500»000 
units of human IFND5 (the activity of human IFND5 is detmnined as described previously) 
three times a wedc for 4 mondis; the remaining two woodchudcs are injected with placebo and 
used as controls. Follow-up includes weekly serological (WHV sur&ce antigen and anti'WHV 

5 sur&ce antibodies by ELISA) and virobgicai (WHV DNA in serum by real time quantitative 
PGR) as well as monMy immunologk^ (T-helper responses against WHV sor&ce and WHV 
OQie antigens measured by intcfleukin 2 production fixim PBMC incubated wifli those protetDs) 
studies. Eboally, basal and end of treatment liver biopsies should be performed to secure liver . 
Inflammation and intrahepatic WHV-DNA levels. The final goal of treatment is decease of vixal 

0 replication by WHV-DNA in serum, widi secondary end points being faistDlogical inqoovement 
and decrease in intrahepatic WHV>DNA levels. 

For I0F-1» the in vivo thersqpeutic efficacy is tested in animals in situations of IGF-I 
deficiency such as liver cirriiosis in rats* Several reports (56-58) have been published showing 
dmt recombinant human IGF-I has marked beiKeficial e£fects in increasing bone and musde 

5 mass, impro v ing liver fimction and correcting hypogonadisnL BntOy, the indnctum protocol is 
as follows: liver drrfaosis is induced in rats by irihalation of carb<m tetrachloride twice a week 
fiir tl weeks, wxdi a progressively increaang exposure time fami 1 to 5 mimites per gassii^ 
session. After die 11*^ week, animals condnae recdving CCI4 once a week (3 mimiles per 
inhalation) to casBplet& 30 weeks of CX34 ad mlu lslratiiHL During die whole iiuloction period, 

0 phenobaibital (400 mg/L) is added to drinking water. To test the ther^peudcefBcai^ of tobacco- 
derived IGF-I, cirrhc^c rats receive 2 (ig^IOO g body wdght/day of Ibis compound in two 
divided doses, during the last 21 days of the induction protocol (weeks 28, 29, and 30). On day 
22, animals are sacrificed and liver and blood samples coUected. The results are compared to 
those obtained in cinhotic animals receiving placebo instead of tobacco-derived IGF-I, and to 

5 healthy control rats. 
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PRODUCTKXY OF HUMAN INSULIN IN TRANSGENIC TOBACCO 

FffiLD OF THE INVENTION 
This invention relates to prodaction of liigh value phannaceixtical pioteiBS in nuclear 
ttansgenic plants, pazticttlaily to production of human insulin in transgenic tobacco. 

BACKGROUND 

Rcscardi on human proteins in ihc past years has rcvohitionized tibe use of these 
thier^^eaticallyvahiablepiotenis ma variety of clim Since Ac demand fixr these 

proteins is expected to increase considerably in ttie coming years, itwuldbewiseto rasurefliatin 
tbefuturethey ¥riU be available in dgnificantlylBisiBr^ oo a cost-effective ba»s. 

Because nx>st genes can be esqxressed in niany difierent system 

system offers the most advantage for the manu&cture of the recombinant protein. An ideal 
expression syst^ would be one that produces a maxunnm amount of safe, biologically active 
matenal at a TrrimnniTn cost The use of modified mammalian cells with recombinant DNA 
techniques has the advant^ of resulting in products, which are closely related to those of natural 
origin. However, Gulturing these ceUs is intricate and can only be earned out on linute^ 

The use of microocgpniams such as bacteria permits manu&ctnre on a larger scale, but 
introduces the disadvantage of producizig products, which differ appreciably fixmitlie products of 

20 natural origin. For example, proteins that are usually glycosylated in humans are not glycosylated 
by bacteria. Fttcttiennore, human protrans tfiat are esqxressed at high levels in E^coU fiequently 
acquire an unnatural conformatian, accompanied by intracdhilarprecipitation due to lacko^pzoper 
folding and disilfide bridges. Production of recombinant proteins in plants has many potential 
advantages for generating biopharmaceuticals relevant to clinical medicine These include the 

25 following: (i) plant systems are more economical than industrial &cilities using feim^itation 
systems; Qi) technology is available for harvestii^ and processing plants/ plant products on a largp 
scale; (iii) eliminatioa of the purification requizement when the plant tissue containing the 
recombinant protein is used as a food (edible vaccines); ^)p]antB can be directed to targ ptp K )l Bms 
into stable, intraceUnlar compartments as diloroplasts, or expressed direcdy in dhloroplasts; (v)tiie 

30 amount of recombinant product that can be poduced approaches industtial-scale levels; and (vi) 
health risks due to contamination with potential human pathogens/toxins are mmunis^d* 

It has been estimated that one tobacco plant should be able to produce moie recombinant 
pcotem than a 3004iter fcrmeotcr of KcoH (Crop Tech, VA). In addition, a tobacco plant can 
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pi:Ddm:eaiidlHoiiseeds»&ci]itati^ Tobacooisalsoanidealcfaoicebecause 
of its felative ease of genetic man^nilatton and an in9)endingneed to explore ahemate uses fi>r this 
hazaidous crop. However, with the exception of enzymes (e.g. pbytase), levels of foieigp proteins 
produced in nuclear transgenic plants are generally low, mostiy less than 1% of the total soluble 

5 protein (KnsnadietaL 1997). May etaL (1996) discuss this problem using the following exanq)les. 
Although plant derived recombinant hepatitis B surface antigen was as effective as a commercial 
recombinant vaccine^ tiie levels of expression in transgenic tobacco were low (0.0066% of total 
sdbbie protcinX Even though Norwalk virus capsid protein expressed in potatoes caused oral 
immunizaticm when consomod as food (ediUe vaccine), esqpicssioa Icvds were low (03% of total 

10 scdubleprotdn). 

In particular, expression of fauroan protdns in nudear transg^c plants has been 
dis^)pointingtylow: e.g. human IntCTfeion-p 0.000017% of fresh wd^it, himi an se n t m albumin 
0.02% and erydnopoietin 0.0026% of total soluble protein (see Table 1 in Kusnadi et aL 1997). A 
synth^c gene coding for the human qpidermal growth factor was e^qnressed only iq> to 0. 00 1 % of 

15 total sohible protein in transgenic tobacco (May etaL 1996). The cost of producing recombinant 
pmteins in al&l& leaves was estimated to be 1 2-fold lower dian in potato tiib^ and comparable 
with seeds (Kusnadi etal 1997). However, tobacco leaves are much larger and have much higher 
bicHnassthanalMb. Planet Biotedmology has recently estiinatedlhatatSOrng/hterofinanm 
cell culture <h* transgenK: ffxiits waSk or 50mg/kg of tobacco leaf expression, the cost of pun&d IgA 

20 win be $10>000, 1000 and 50/g; respectively Q>aniell et aL 2000). The cost of production of 
recombinant proteins will be 50-foId lower than that of KcoU fommtation (with 20% esqnvssicm 
levels in BwOoli^OESIjisDadietaL 1997). A decease in insulin e]qn:esskHififQm 20% to S%ofbiomass 
doubled thecostofprodnctianiniS'.coA'(PetridisetaL 1995). Expression level less than 1% of total 
soluble protein in plants has been found to be not commercially feasible (Rjisnadi et aL 1997). 

25 Therefore, it is important to increase levels of esqrression of recombinant proteins in plants to aq^t 
plant production of phaiinBcologically important proteiiis. 

An alternate q^proach is to expiess foreign proteins in chloniplasts ofhig^er plants. We have 
recently integiBted foret gn genes (up to 1 0,000 copies per ceU) into the tz^^ 
resulting in accumulation of recombinant proteins up to 46% of tihe total soluble protein (De Cosaet 

30 aL2001). Chkroplasttransfinmation utilizes 

recombination, insert foreign DNA into the spac^ region between tiie functional genes of the 
chkjroplastgpDonie, thereby taxgpting&efiarcignge This eUxninates die 
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ApositioD effect® and gaae sileaciiig fieqoently observed in nuclear transgenic plants, Ghloroplast 
genetic engineering is an e avir o nm entalty friendly approach, minimizing concents of out-cross of 
intiodiu^tiaitsviaponeatoweedsorodLercr(^(6odcandI&gimam Also, 
the concerns of insects dev^oping resistance to biopestiddes are nmmnized by hyper-esqxiession of 

S single insecticidal proteins (high dosage) or expression of different types of insecticides in a single 
transfoimation event (gBoe pyramiding). Concenisofiiisecticidalpn>teins an non-taiget insects are 
minhnized by lack of e xpitasi on in transgenic pollen (De Cosa et al. 2001). 

Importantly^ a significant advanti^ in the production of phannaceotical proteins in 
dblocoplasts is tiieir alriUfy to process eukaxyotic proteins, including folding and formation of 

10 disulfide bridges (Drescberetal. 1998). Chapa:onin {HOtdns aiepivsiaitin chkoiiplasts (Roy» 1989; 
Vieding, 1991) Ifaat fimction in folding and assembly of pnslcaiyot^ Also, 
proteins are activated by di sulfide bond oxido/tednctjon cycles nsing the cbloroplast tfaioredaxin 
system (Renlland and Nfiginia&Masbw, 1999) or chtorDplastptDtein disnlfidft iscMnerase (Kim and 
Mayfield^ 1997). Accomnlation of folly assembled, disulfide bonded form of fanman somatotropin 

15 via chlorc^last transformatiQn XStaub et aL 2000), oligomenc form of CTB (Heniiqnes and Daniell, 
2000) and foe ass^nbly of heavy/Hght chains of humanized Ou/s 13 antibody in transg^c 
cbloroplasts (Panchal et al. 2000) provide strong evidence for successfol processing of 
phamiaceuticaipax>teins inside diloropiasts. Such folding and assembly should eliminate the need 
for hig^ e^^eosive in litro processing of phamiaceutLcal proteins. For racample, 60% of the total 

20 operatingcostintlieprodnctionoffannianinsulinisassodatedwilhm 
disulficte bridges and cleavage of metfajonin^ Petridis et aL 199S), 

^ Anotherm^or cost of insulin produdlQn is purification. Chramatc^gtq[)i^accoiints for 30% 
of o p c mliu g expenses and 70% of oq[lI^mlcnt in producticin of insulin (Petridis et aL 1995). 
Therefore^ new approaches are needed to minimize or eliminate cfaroma-togrq>hy in insulin 

25 productioiL One sudi approach is the use of (iVGVP as a fusion protein to fodlitate sLo^e stqp 
purification wifoout the use of chromatogr^y. GVGVP is a Protein Based Polymer (PBP) made 
fiom synthetic genes. At lower tranperatures this polynia exists as inoreextendedmolecule^ l^>on 
raising the temperature above the transiti(»i range, polymer hydrophobically folds into dynamic 
structures called p-spirals that forther aggregdte by hydrophobic assoctalioii to form twisted 

30 filaments([^, 1991;UiryietaL, 1994). Inverse tBnq)erHtmstzansitian offers sisveral advantage. 
It ^dlitatesscateiqiofpuiification fiom grams to kilograms. Milder purification comdifion requires 
ordyamodestchaiigB in tranperatureand ionic strength. This should also fecilitate higher recovery. 



wo 01/72959 



116 



PCT/US0iy06288 



1465-PCrr-OO (1577-P-OO) 

&ster puii&^tim and Ugh vohme ps^^ Ttotdn purificadon is geaeraOy the slow stq^ 
(botaeneck)inp3iaiiDaGeuticalprodQ^ Thioug^e:q>loitatianofttns]»veisfl>teiiive^ 
tetxq>eratuie transition property, siiiq>Ie and ine3q>ensive extraction and puii&cation may be 
perfonned« The temperature at ^gvidch. the aggregation takes place can be mampulated by 
5 eogineenng biopolymers containing varying numbets of repeats and changing salt concentration in 
sohitiGn (McPherson et aL, 1996). Chloroplast mediated expression of tosulin-polymer fiision 
pn^in should eliminate the need for the CTq^ensive fermentation process as wdl as reagents needed 
for recombinant protein punfication and downstream processing. 

Oral deliv^ of insulin is yet aiK)therpowerfi]I ^proadi tfiat can climinatci^ to 97% of 

10 productioncostof insulin OP^itndisetaL 1995). For exaoqple^ Sun etaL (1994) have diown that 
feeding a small dose of antigens conjugated to the receptor binding non-tmde B sobumt moiely of 
the cholera toxin (CTB) suppressed systemic T ceH-^nedi ated inflanrmatory reactions in ammals. 
Oral admmistration of a myelin andgen conjugated to CTB has be^ shown to protect animals 
against encephalomyelitis, even vtdien given after disease induction (Sun et al. 1996). Bergprot etaL 

15 (1997) reported that feeding small amounts oflniman ins»ilin conjugated to CTB suppressed beta cell 
destruction and cliiiical diabetes in aduh non-obese diabetic (NOD) mica The protective effect 
could be transferred by T cells fiom CTB-insuhn treated animals and was associated with reduced 

insnlitis. Thesa rBmiHR rfrnnnnKtrnte tftat pm^agHnm againgfr nntnitrnninift dinKfttan asm triHaftH V>ft 

achieved by feeding small amounts of a pancreas islet cell auto antigen linked to CTB ^ergerot et 
20 aL1997)> Coi^agation with GIB facilitates antigen ddiveiy and presentato 

Lynqphoid Tissues (GALT) due to its afSnity for tiie cell sui&ce receptor GM|-gangliosfde located 
on GALT cdls» fer increased iq^take and immunologic recognition (Arakawa et aL 1998). 
Transonic potato tubers c^cpressediq) to 0.1% CTB-insuUn fusion protein of total sohbleprotrin, 
wMch retained (j^i--gaDgUoside binding afBnity and native antogenicityf^ gndrnjaiKn, 

2S >fnn rmm fftrf with transgftmr pntatAfiibCTR pnrttann'rtgnriiCTngratn ^jn^"^'*^ of CTFMg?ffHn filsion 

pn^in showed a substantial reduction in insuhtis and a delay in the progression of diabetes (Arkawa 
et al. 1998). However, for commennal oqtloitation, the levels of expression should be incr^dsed in 
transgenic plants. Thetefior^ we propose here esqxression of CIB-insulin fu^ 
cbl<»oplasts of nicotine free edible tobacco to increase levels of expression adeqpiate for animal 
30 testing. 

Taken togedier, low leveb of esqnession of human proteins in nuclear transgenic plants, and 
di£Bculty in foldinig^ assembly/processing ofhuman proteuis iaE^coli should make chloioplasts an 
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diloroplasts should also dracoaticaUy lower die pix)ducticm cost Laige-scale production of insuHn in 
tobacco in conjunction witti an oral deliveiy Systran can be a poweifUi approach to provide tieatment 
to diabetes patients at an affordable coat and provide tobacco ^samsts alternate uses for this 
5 hazardous crop. Therefore, it is first advantageous to use poly(GVGVP) as a fusicm protein to enable 
hyper-^expression of inailin and accomplish rapid one step purification of the fusion peptide utilizmg 
the inverse tranperaturetransitifm properties of diis polymer. It is finther advantageous to develt^ 
insolin-CTB fusion protein &r oral delivoy in nicotine fiee edibk tobacco (LAMD 605). 
Bodi addevemeats can be acc(mq>lished as fdkyws: 

10 a) fVvi^lnp rftft^Ki'mmt HMA vt^tora for enhanced exprBMrinn nfPrnnwnlin aa fiiRtnn pmteifMt 

with GfVGVP or CIB via dilofoplast genomes of tobacco; 

b) - Obtain transgenic tobacco (Petit Havana & L^^ 

15 c) Charactedze transg^c expression of proinsulin polymer or Cro 
mdecular and biochemical mediods in chloroplast^ 

d) Employ existing or modified methods of polymer purification fitmi transonic leaves; 

20 e) Analyze Maidelian or maternal inheritance of transgenic plants; 

f) Laigp scale purificatLon of insnHn and CQn]|>arison of current insnlin pnrificatioii mediods 
vnfh. polymer-based pnrification meOaodhiRcott and tobacco; 

25 g) Qinq»aie natural refolding in ddoroplasts with mW/n^pi^^ 

h) Charactecization (yield and purity) of proinsulin produced in^^^ 
and 



30 i) 



Assessment of diabdic symptonis in inice fed with edible tobacco expr^ 
fiision protein. 
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Diabetes and InsuBn: Ilie most obvioiis action ofuisalm is to lower blood 
1973). Tbis is a zesuh of its inmiediateefifect in increasing Innmscle,^ 
under tbe action of insulin^ ghicose is more readily taken and eidiar converted to ^cogen and 
5 lactic acid or oxidized to carbon dioxide. Insulin also affects a number of important enzymes 
concerned widi cellularmetabolism. It increases the activity of ghicokmase, wMch phosphorylates 
^uco66» thereby increasing the rate of glucose metabolism Lisnlin also si9)presses 

g^nconeogeoeas by depressing the function of liver enzymes^ wbich openite die reverse padiway 
fiom proteins to ^acos& Lack of insulin can restrict the transport of ghicose into muscle and 

10 adipose tissue, lliis results in tnmases in blood ghicose levels (hypeig^ Ih addttion^tfae 
breakdown of natural &t to fiee fitty adds and glycol is increased and there ^ 
add content in the blood Increased catabolism of &tty adds by the Irver results in greatsr 
production of ketone bodies. They difinse from the liver and pass to die musdes far fiirtfaer 
oxidation. Soon, ketone body production rate exceeds oxidation rate and ketosis results. Fewer 

15 ammo acids are taken by the tissues and protein degradation re^ilts. At die same time, 
^uconeo^nesis is stimulated andprotein is used to produce ghicose; Obvious^, lack of insolm has 
sedous consequences. 

Diabetes is classified into types I and IL Typelisalsoknomtasinsolindependentdiaibetes 
melhtus QDDM), Usually dus is caused by a cell-mediated autoimmune destruction of d^ 
20 pancreadc p-cdls (Davidson, 1998). Those suffering fiom this type are dependent on extemal 
sources of insulin, l^pe n is known as noairBulin*<lq)endent diabetes indlitus This 
usually intvolves resistance to insulin in combinatLoa with its un These pronnneDt 

diseases have led to exten^ve research into microbial production of reoombinant hmnan fnsnlin 
(rHI). 

25 

Expression of Recombinant Human Insulin in KcoGi In 1978, two thousand kilograms of insulin 
were usedin the world each year,halfofdus was iBedin the UnitedStates(Ste]ii» At 
diat time, the nmnber of diabetics ini the US was UMsreasing 6% every year (Gnnby, 1978). In 1997- 
98» 10% increasemsales of diab^ care products and 15^ increase in insulin producte 
30 rq)ortedbyNovoNmlisk,makingita7.8bi]tiondol]arindus^ 

killed by diabetes, makingit the fourdi leading cause of death. Maz^metfaods of production of rlfl 
have been developed. Insulin genes were first chemically syntfaesizBd for ei^ression in Esheridm 
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co&(QneaetaL,1978). These geaaes encoded sqwiate insiilin A and B chains. Hie gates weieeadi 
e^q^iressed in £lco/i as fusion prol^ Thefiist 
documented production of iHI using this system was reported by David Goeddel from Genentech 
(Hall, 1988). The ^Boes were fused to the Trp synthase g^wMch]«sul^ 

5 yield, due to tiie smaller fusion peptide. This fusion protein was approved for cotmnercial 
fHoductionby]^ Lilly in 1982 (Qiance and E'rank, 1993) with a product name of Humulin. As of 
1986, HuDmlin was produced fbomproiiisalin genes. Proinsulin contains both insuUnchams and the 
Cipeptide that connects them. Normal in vUro post-translational processing of proinsulin inchides 
useoftiypsinandcaEboixypeptidaseBfbrmBtDnitiQatoinsu]^ Other datacanccmmgc omui ci ria l 

10 pcoduGtionofHornidinaiidodierinsolinpxoducts is now considered proixnetary mfisnnation andis 
not available to &e public. 

Protein Based Poiyiiiefs(PBP): The synthetic gene that codes for a bioelastic PBP was designed 
after repeated aniiiio add'sequencesOVGfVP,observBdinaUseqiiffwy^mammnatiane^^ 
(Yeh et aL 1987). Elastin is one of ttie strongest known natural fibers and is present in skin, 
15 ligaments, and arterial walls. Bioelastic PBPsccmtainirig multiple rq)eats of this pentamer have 
remarkable elastic properties, enabling several medical and rK>n-medical applications (Urry et al. 
1993, Urry 1995, Daniell 1995). GVGVP polymers prevent adhesions following surgery, aid in 
reconstrocting tissues and dehv^mg drugs to the body over an tended period of time. Nordi 
American Science Assodates, Inc. tepaited that GVGVP polymer is non-toadc in mice, non- 
20 seDsitizmg and non-anti£^c in gmnea.pigs, and non-pyrogenic in rabbits (Urry et al. 1993). 
Reseaichera have also obsoved that inserting she^ of GVGVP at the ates of contammated wounds 
in rats reduces the number of adhesioiia that fonoi as the woimds heal 

maimn; using die GVGVP to encase muscles that are cut during eye sorgoy in raUnts prevents 
scarring following the opmtion (Urry et al. 1993^ Uny 1995). Other medicdl q^pHcations of 
25 bioelastic PBPs include tissue reconstruction (synthetic ligaments and arteries, bonesX wound 
coverings, artificial pmcardia, catheters and pn^ammed drug delivery (Urry, 1995; Urry et aL, 
1993, 1996). 

We have expressed the elastic PBP (GVGVP)i2i mExoli^Qodst et al. aL1995, Brixey et aL 
.l991),mihG^S^A^>er^UusniduIans(B^ 1997),incultQrBd1obacoo cells (Zhang etaL 

30 1995), and in transgenic tobacco plants ^Sungetal. 1996). In particular, (GVGVP)i2i has been 
expressed to such hig^ levels in Exoli HOBt polymer inchision bodies occiq>ied \sp to about 90% of 
the cell volume. Also, imdnsion bodies have been observed in diloroplasts of transgeruc tobacco 
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plants (see Darnell and Guda, 1997)* Recent^, we rqxiited stable tiansfbi^^ 
chloroplasts integialion and esqpiiessk^ 

region (5^000 copies per cell) or die Liverted Repeat i^on (10, 000 copies per cell) of fbe 
ddoroplast genome (Gxida et al^ 2000). 

5 

PBP as Fusaon Proteins: Several systems axe now available to siinplify protein puiification 
inchiding the maltose binding protein (Manna et aL 1988)» gfaitothione S^ransfeiase (Smith and 
Jdmson, 1988Xbiottnyiated(Tsaoe^aL 1996)»tfai<ttedoxtn(SniithetaL 1998)andccMo66Unding 
(QttgetaL 1989)pratein3. RecombinantDNAvectoEafbrfbsionwithato 
10 effectivelyiitilize aforementiofned fiision proteins in the potification pn)cess (SmidietaL 1988; Kbn 
and Raines^ 1993; Su et aL 1992). Recombinant proteins axe genemUy pmified by affinity^ 
cliFcxnat0grapi]y» using lig^odsspedfic to carrier protd Wbile these aie 

useful techmqnes for laboiatQiy scale purification, afOnity daomatogn^hy for large-scale 
puii&ation is time consuming and cost prohibitive. Theiefbce^ economical and non- 
15 chn)matogr^hic techniques are highly desirable. In addition^ a common sohiticHi to N-tenninal 
degradationofsmaUpq)tides is to fuse foreign pq>tidestoendogenous£.co/iprotd^ Ead/inthe 
development of this technique^ ^-gaiactosidase (p-gal) was used as a fosion protein (Goldberg and 
Gof^ 1986). A drawback afttiismefiiod was&atlfae P-gal protein is of relatively highmolecnlar 
weight (NiW 100,000). Therefore, the p!n)partionofthe peptide product in Ihe total prote 
20 Anodm^problem associatisd with die large p-gal fosion is eaily termination of translation (Bumette, 
1983; Hall^ 1988). ThisoocuxredwhenP-galwasusedtoproducehimianinsuMpepti 
the fosion was detached ficmi foe ribosome during translation thus yielding incomplete peptides. 
Other ppoteins of lower molec^dar weight proteins 

pqitide production. For example, better yields were obtained with the trypto^ian synfoase (190aa) 

25 fosion proteins (Hall, 1988, Bum^ 1983). 

One primary advantage of tiiis invention is touse poly(GVGVP) as a fosicmprotdnto enable 
hyper-aquression of insulin and acconqpltsh r^id one step purification of the fosion peptide. At 
lower temperatures the polymers exist as more extended molecules which, onndsing the temperature 
above foe tran^on range hydrophobicaUy fold into dynamic structures called P-spoials foat forfher 

30 aggregate by hydrophobic association to fom twisted filaments (Urry, 1991). Hmmj^ exploitation 
ofthis reversible property, simple and inexpensive extraction and purificati^ The 
tempeiature at whidh aggregation takes place (TO can be manipulated by engineering biopolymers 
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oanlainiiig varying numbeis of repeats or changitig salt coanoentcatioD (McPhetson et aL, 1996). 
Another group has lecenHy demmistzated panfic^on of recombinant proteins by fusion with 
tbennally re^xmsive polypqytides (Meyer and Chillooti, 1999). Polymers of (fifferent sizes have 
been syntiiesized and ex|Nressedin£c0i2. This aptmach can also eliminatft the need for eayeasive 
5 reag^ts, equipment and time required for purification. 

Cholera Toxin p subonit as a.ffision protein: Vibrio choleras causes diairheaby colonizing the 
small inte^e and piodocfng enteix)toxins> of which the diolera toxin (CT) is considered die main 
cause of toxidty. CT is a hexamenc AB3 protein having cme 27KUa A snbunit winch has toxic 
ADP-jibos^ fzansf^ase activity and a non-toxic pentamer of 11.6 1^ 

10 nmialflntly ItnlpftH ififA » yyfy sfe|b1? ^ftnghn^ B)cft stpicfaim intn urhtrli tl^y. tr>ylfi ^i^^tnift (A) gnhrnnt is 

inserted. TheAsubimitof CT consists of two fiagments-Al and A2 which are Im^ 
disolfidebond. Theen^maticactivityof CTislocatedsolely wdie Al &agmeafTt(Gill, 1976). Hie 
A2fiagmentoflbeAsvibimitlinksfheAl fiagmentandtfaeBi^ CTbin^ viaq[iecific 
interactions oftheB subumtpenianierwilliGMl g^nglioside, the membrane receptor^ present on the 

15 intestinal ^ididial cell scErface of the host The A subunit is then translocated into the cell v^diere it 
ADP-ribosylates die Gs subnnit of adenylate cyclase bringing about the increased levds of cyclic 
AMP in afifected cells that is associated with the electrolyte and fhud loss of clinical cholera (Lebena 
etai 1994). For optunal enzyinatic activity, the Al fragment needs to be separated &om the A2 
fragment by proteolytic cleavage of the main chain and by reduction of the disulfide bond linking 

20 them (MekalanosetaL 1979). 

Expression and assembly of CTB intransgemcpotalo tubetslmbeairqpo(ted(Arakawa et 
aL1997). Ibe CTB gene including die leader peptide was fiised to an endq)Iasniic xelicobnn 
retentiQn signal (SEKDEL) at the 3^ end to sequester the CTB protein within the huncn of die ER. 
The DNA fiagment encoding the 2 l-aniino acid leader peptide of die CTB protein was retained to 

25 direct die newly synthesized CTB protein into die hmien of the ER. Tmmimoblotanalyas mdicated 
that the plant derived CTB protein was antigenically indistingpishable from the bacterial CTB 
protein and that oligomeric CTB molecules (Mr ^ 50 kDa) w^ the dominant molecular species 
isolated fiom transgenic potato leaf and tnbertissues. Slmilarto bacterial CIB,p]antderived CTB 
dissociated into mononiecs (hlr^lS IdDa) dudng heat/add treatment 

30 Enzyme linked iimmmosorbCTtt assay methods indicated diat plant synthesized CIB protein 

bound q>ecifically to GMl gangUosides, die natural membrane receptors o The 
maximum amount of CTB protem detected in auxin induced transgenic potato leaf and tuber tissues 
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wa8qpim)xhnatBiy03%oftfaetotaI soh^ llie oial iimiimiizatkm of 

transgenic potato tissues transformed the CTB gene (admiziistered at weddyr intervals for a 
montti mdi a fina] booster feeding on day 65) has also The levels of senm and 

mncosal anti-cholera toxin antibodies in mice were found to generate protective innmmity against 
S the cytqpatbic effects of CTholotoxin. Following intraileal injection witii CT, theplant iTrmmiized 
mice showed iqi to a 60% reduction in dianheal fhiid accumulation in the small intestine. Syst^nic 
and mucosal CTB- specific antibody titers were detennined in bodi senxm and feces coDected fixnn 
immunized mice by the class-specific diemituminescent ELIS A mediod and the enci^int titers for 
tbethre6antibodyisotypes(IgMJgGandIgA)weR;det6rnuned ThecactentofCTneotralizationin 

10 bolbVeiocen and fled loop e7q;)erQnents suggested duitanli-^^ 

f^lhilstrfiMT-gsmgKngtAsg Also, mice fed With 3 gof transgenic potatpexhibitBdgimito 
protection as mice gavagied with 30 gofbacterial CTB. Recombinant LTB[iLTB] (the heat labile 
enterotoxm produced by Enterotoxigpiic Kcolf) which is stnictnrally, functionally and 
inmnmologjcally similar to CTB was eaqHessed in tiaiisgemc tobacco (Ai 

IS 1995). They have reported tha^ the rLTB retained its antigiMiicity as shown by innmmoprecqiitation 
of rLTB with antibodies raised to rLTB firom E,coli, The rLTB protein was of the right molecular 
weight and aggregated to form the pentamer as confirmed by gel permeation chromatography. 
Delivery of Human Insulin: Insulin has been delivered intravenously in the past several years. 
However^ mora recently^ altemate melhods such as nasal spray, are also available. Oral delivery of 

20 insulin is yet another new appioadi (l^thiowitz et al., 1 Engineered polymer microspheres 
made of biologically enxfable polymera» which display sbrang inteiactioas widi gastroinCestinal 
mucus and cettnlar linh^, can tnrvnse both mucosal aibsoiptive ^[lilhBlium and fto folHde^ 
associated c{iitticKtnn,covG!n]|g die l)nnqpho^ Polymers maintain contact 

with intestinal epididium for extoided periods of tune aiid actna% 

25 cells. Animals fed witii die poly OPA: PIXxA)-encq)Sulatedinsulinpr^aratLonwereable to regulate 
the ghicose load better than controls^ confiiming that insulin crossed the intestinal barrier and was 
released fiom the microspheres in a biologically active form (Madiiowitz et aL» 1997). 

Besides, CTB has also been demonstrated to be an effective carder molecule for die 
induction of nmcosal innmunity to polypq>tides to which it is chemically or geni^ically conjugated 

30 ^dcKenzie et aL 1984; Dertzteug^ et aL 1993). The production of immunomodulatary 
transmucosal carrier molecules^ sudi as CTB, m plants may greatly inqxrove the efBcacy of edible 
plant vaccines CBdqetaL 1995; Thanavala et aL 1995; Mason et aL 1996) and may also provide 
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novel oia] toleraoce agentsfixTpieveat^ 

aL l991XRb»nz2atDidaxthiitis(T^^ 199(^Afiller 
etal. 1992; Werner etaL 1993)asweUa&theisrevexitionofaUeigica]ida]logr^ 
(Say^etal. 1992; Hancock ^aL 1993). Theiefb(e,e7gMressiiigaC^ 
s aa ideal ^){HX>ach for oral deUvery of insoli^ 

CMoroplast Geaedc Engmeering: Vifhai we developed die conc^t of chlon^last genetic 
engmeoriTig (Daniell and McFadden* US. P&teots; Danidl, World Patent 1999X it was 
possble to inttodttce isolated intact ddodoplasts into protoplasts and rpgenecate tnui^enic plants 

10 (Cadson, 1973). Theiefixce, eaxly investigatioDS on cfal(»oplast tiansfbnnation focused cm die 
development of in arganeUo systenis using intact chkm^lasts citable of efficient and prolonged 
tiansoqition and translatiozi (Danielland Rebeiz, 19S2; DameQ et aL, 1983» 1986) andexpiessioiiof 
fiseigngieRes in isolated (Mm^la5ts(DanieU^ However, afterlhediscQveiy 

of the gene gun as a transformation device (Daniell, 1993), it was possible to transfiimn plant 

15 chlorofdasts wi&out the use of isolated plastids and pn>tq)lasts. Chloioplast genetic engioeering 
was accomplished in several i^iases. Transient expression of foreign genes in plastids of dicots 
(Paniell et aL» 1990; Ye et aL, 1990) was followed by such studies in monocots (Darnell et al.» 
1991). Unique to tiie chloioplast genetic engmeering is the develojHnent of a foreign gene 
expression system usuig autonomously iq[>licating chloroplast expressicHi vectois (Daniell et a]., 

20 1990). Stable integration of asdectablemailxr gene into die tobacco chloi^ 

Maliga, 1993) was also accomplished usir^ die gene gun. However, useful g^mes confeanring 
vahmUe traits via ddorpplast genetic mgineedng have been demonstrated only icoaitly. For 
exanqple, plants lesistani to B.t sensitive insects were obtained by integrating the ayMc gem into 
the tobacco chloroplast gencmie (McBride et aL, 1995). Plants resistant to B.t resistant insects (up 

25 to 40,000 fohQ were obtained by hyper-esqnression of the cryUA gene within the tobacco chlon^)last 
genome (Kola etaL, 1999). Plantshavealsobeengenetically engineered viathechloroplastgBDome 
to conftr betbicide resistance and the introduced foreign genes were maternally inhedted, 
overcoming the problem of out-cross with weeds (Daniell et aL, 1998). Oilorqplast genetic 
eogineering has also be»i used to produce phaonaceutical products that are not nsed by plants 

30 (StaubetaL2000»OudaetaL2000). CUoioplast genetic engineeringtecfanology is cunenttybetng 
q)plied to odior useful crops (Sidorov et aL 1999; Daniell, 1999). 
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SUMMARY OP INVENTION 
Ibis inv^cm syn&Bdzeshie^ value itom^^ in nucleartrans^c plants by 

cAlon>plaste3q)iessionf(Mrphannaceiitical|^ Chloroplasts are suitaWe for iMs 

5 purpose because of ^eir ability to process eukaryotic proteins, including folding and fonnation of 
disulfide bridgf^, t^fir^^V ftliminflrin g the need for expensive post-purificatioa proccsang. Tobacco 
is an ideal choice for this purpose because of its large Woniass, ease of scaienqj (m 
plant) and gpnrtic manipulation. We use poIy(GVGVP) as a fusim protein to enable l^per- 
expression of insulin and acccniq>lish ispid one stc^ purification effusion pqstides utilizing tbe 

10 inversetemp6nitaretransittonpiq;)6rtiBSofthisp<dyn^ Wealsouseiiisulin^JrBfiiffliHiiTOtBb 
chloroplasts of nicotine fiee edibiB tobacco (LiVMD 605) for oral deHvecy to NOD mice. 

mmp PRS CRIPTEON OP DRAWINGS 
Fig, 1 shows graphs of Ciy2Apiotdn ccmcentratiQiidetennined by 

IS leaves. 

Kg. 2 is an innmnogold labeled electron microscopy of a matore transgenic leaf 

Fig. 3 contains i^tographs of leaves infected with 10 of 8x10^, 8xlO\ 8x10^ and 8x10^ 
cells of P. syringae five days after inoculation. 

Fig. 4 is a graph of total plant protein mixed with S |U of mid-log phase bacteria fiom 
20 ovemi^cuttute^ incobated Air two hours at 2500 at 125 ipm and grown in LB broth ovemig^ 

Fig. 5Ais agii^hof CTB EUSAqpnntificationshownasaperceirtago of t^ sofaible plant 
protein. 

Fig. SB is a gc^ of CTB-GMl (^gUoside bindii^gEIJSA assays. 
Fig. 6 is a 1 2% reducing PAGE using Chemiluminescent detection witb rabbit anti-cholera 
25 seram (1 □) and AP labeled mouse anti-rabbit IgG (2D) antibodies. 

Fig?. 7AaiidB showreducinggelsofexpressionaiuiassemblyof disulfide bonded Guy==s 13 

monoclonal antOxxfy. 

Fig. 7C shows a noonreducing gel of expression and assembly of disulfide bonded Giqfs 13 
monoclonal antibody. 

3D Figs. 8A - F show pihotograite compaiiiig betaine aldehyde and spectinoniycin selection. 

Fig3. 9A and B show biopolymer-proinsulin fusion protein expression. 
Fig. lOA shows western blots of biopolymer-proinsulin fusion protdn aft^ single step 
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Fig. lOBsl^w western blots of anotiierUopofyine^^ 
steppurificatioii. 

Fig. IOC shows westm blots of yet aiiotobiopolynierf^ 
5 steppurificatioa. 

Fig, 1 1 shows biopolymer-pioinsalin fijsion gene integratioii into tiie cidoroplast genome 
conformed by Soutfaem blot analysis. 

DETAILED DESCRIPTLON 

10 Aiemadcablefeatiixe of chknoplast genetic • 

large accnmnkitian of fbie^ pEOteios m This can be as mucii as 46% of CRY 

pcotHnintotalsoliibte protein, eroiiinbleachedoldle^^ Stable eqiressicxi 

of a phaimaceutical protein in dhloioplasts was fbst leported for GVGVP» a protein based polymer 
wi4i varied medical sq[>plicatiQns (socbas thepreventioQ of post-sinsjcal adhesions and scais» wound 

1 s coyerings, artificial pericardia, tissue reconstniction and programmed drug ddiveiy) (Ouda et al. 
2000). Subsequently, expression of the human somatotropin via the tobacco diloroplast genome 
(Staub et al. 2000) to hi^ levels (7% of total soluble protein) was observed The following 
investigations that are in |MX>grBS8 illustrate &e power of this technology to express small peptides, 
entire opmns, vacdnes that require oligomeric proteins with stable disulfide bridges and 

20 manocloiials that require assembly of heavy/Ughtchaiiis via cli^)en>^^ It is essential to develop a 
sdectum system free ofanltbioticresistaiit gates for tiiee^le insult * 
One such marker fiee dbloroj^ast transformation system has been accmnpKdiBdCDanidl et al. 2000). 
E xpet im eats are in p rogre ss to develop dikxroplast transformatioiQ of edible leaves (al&]& and 
lettuce) fiir die practical applications of Ibis i^proack 

25 

Engineering novd pathways via the chloroplast graome: In plant and animal cdls» nuclear 
mRNAs are translated monodstronicaliy. This poses a serious problem when engineering multiple 
genes in plants (Bogorad, 2000). Therefore, smgle genes were first introduced into individual 
transgenic plants, thea these plants were back-crossed to reconstitute the entne pathway or die 
30 completeproteintoejqHfessfbepol^iiydiDzybutyratepolymeTorGiq^ 13antiboify(Navraib^aL 
1994;Maetal. 1995). Sirnilady, in a seven year long effort, Ye etal. (2000) recently 
set oftfaree genes for a short biosyiitfaetic pathway that resulted in PH:an>trae 63^^ In 
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comtrastyinostcUQn^Iastg^nesofliig^p^^ Bqnessionoi 
polycistions viatfae diloTOfdast genoiTO 

a sin^e transfimnatioii event We have lecentiy used die BaeUJvs 0tunn^ensis (Bt) aylAal 
openm as amodel system to demonstrate opewmo^Hession and ciystal torwaskm viafte chloroplast 

5 g^ome (De Cosa et aL 2001), CrylAsa is die distal gene of a three-gene operon. The arf 
immediately upstream of cryTAsQ. codes for a putatiye diaperonin that Militates the folding of 
ci>2Aa2 (and oAer proteins) to fonnproteolyticaDy stable 1998). 

Therefore, the crylh^ bacterial ppcran was expressed in tobacco chloroplasts to testtiie 
lesultant transgenic plants fi»r increased eaqsession and inqxioved persistence of the accnxmilated 

10 insecticidalprDtein(s). Stable forcigpgeQe integration was confirmed by PGR and 

anafysis in To and Titiansg^c plants. Cry2Aa2 operon derived pnotein acmrnnlated at 453% of 
the total soluble protein in mature leaves and remained stable even in old bleached leaves (46. 1%) as 
sbowninFig.1. Ibis is ifaehis^ level off<xdga gene esqpcession reported hi tr^ 
Bxceedmgly unccmtroUable msects (lO^lay oldcottonbollworm, beetaimy wcam) were killed 100% 

15 after consuming transgemc leaves. Electron micrographs showed the presence of toe msecticidal 
protdn folded into cuboidal crystals shnilar in shape to Cry2Aa2 crystals observed in Bacillus 
thuringiensis as shown in Fig. 2. In contrast to currently marketed transgenic plants with soluble 
CRY proteins, folded protoxin <»ystals are processed only by target insects that have alkaline gut 
pH. This approach improves safefyofBt transgenic plants. Absence of insecticida] proteins in 

2a transgEaaic pollen eliminates toxicity^ to non-targpt insects via pollen, in addition to these 
enviroamentaUy Meodly ^tproacbeSy this observation sen^ as amodel system for large-scale 
production of foreign proteins vidttnn diloroplasts in a folded configuration enhancing Ihmr stal»lity 
fflrf ftril^^g qwgtft jgteppmrlffcahW Thisistfaefirstdcmonstratianof CTqiressionof abactedal 
opeion in ttansgemc plants and opens the door to engineer novel pathways in plants in a single 

25 transformation event 

Expressing small peptides via the chloroplast genome: It is ccnnmon knowledge that the medicai 
community has been fightmg a vigorous batde against drug resistant pathogenic bactena for yeats* 
Cationic antibar^mni peptides finm mammals, amphibians and insects have gained more attenticHi . 
30 over the kst decade (Bbnoock and Lehrer, 1998). Key features of&esecationic peptides are a net 
positive chargD^ an afGnify for negatively-charged prokaryotio membrane phoq)holipids over 
neutral-charged cukaryotic membranes and the ability to form aggregates ^t disrupt the bacterial 
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miembrane ^iggiii and Saixscm, 1999). 

There are three major peptides^ Or^idk^ 

(g«nt aJkmolh), mag^inins frcm Jfewj/Jus lae^Hs (Afiacan firog) and defeaasins ftom Tn a mm alia n 

neutrophils. Magainin and its analogues have been studied as a broad-^tr^ 

5 systemic antibiotic; a woimd-healing stimulant; and an anticancer agent (Jacob and Zasloff, 1 994). 

We recently observed tot a synthetic lytic peptide (MSI-99, 22 amino acids) can be successfully 

expressedin tobacco chlaroplast(I>eGiay^al. 20^ 

tte pi^topafeogpoic bacteria Pseudomemas syringae and nmhidnig resi^ant fanman pathog^ 

10 a^to-hdix molecule tot has an afBmfy for nc^glively charged phoq^lqriite commonly found in 
tooatcr-«icmhraneofbactBria. Upcmcanlactwiflitosememlaancs,individdal 
tofonnporesin the m«anlMane,resaltmgin bacterial lys^ Because of to concaitrationdqwHlent 
action of to AMP, it was expressed via to diloroplast gmome to accomplish dose ddivery at 
topoinlofmfectian. products and Southern blots confirmed chloroplastintegratM^ 

15 foirign genes and hranoplasmy. Growth and development ofto transgenic plants was unaffected 
by hyper-expression of to AMP witoi chlort^lasts. In vitro assays widi To and Ti plants 
confirmed that to AMP was e^qiressed at high levels (2L5 to 43% ofto total sohibleprotdn)and 
retained biological activity against Aettttomojws syringacy a major plant patogen. M situ assays 
resulted in inteose areas of necrosis around to point of infection in control leaves, while 

20 transformed leaves showed no sigos of necrosis {200-800 jig of AMP at to ate of infecdwi) as 
sh0wninFi&3. Tiinritro essays B&aaxsstFsetid(^^ 

paiho^)diapkyeda96%uihibitionofgrowfliasshowninRs.4. These lesuhs give anew option 
intobatfleagainsli*vtppatfaogcoicanddrugHresistanthumanpa& Small peptides 

(Kke insulin) arc degraded in most organisms. However, stability of diis AMP in chloroplasts opens 
25 19 this oompartmCTt for ejqxression of hormones and other small peptides. 

Expression andassembly<rfmonoclonalslntransgenicchloropbists: Dental caries (cavities) is 
probably to most prevalent disease ofhiwnaiilrind. Colonization ofte^ by ^mtilanff is to single 
lno^i^^x]mntmlctot0r hi to devdc^eot of dental canes. SLmi^onsr is a non-motile, gram 
30 positive coccus. It colonizes tootiisuitoes and synthesizes ghicans (insoluble po^^^ 

friKtois fiom sucrose using to enzymes ghicosyltransferase and &uctosyltransferase respective 
(BotzetaL 1972) llieghicanspbyaniniportantrolebyanowing to bacterium to adhere to to 



wo 01/72959 



148 



PCr/USOl/06288 



1465-KT-a0(1577-P-00) - ^ 

smooth toollisiii&ces. The bacterium femente sucrose and prodaces lactic acidafiorits adherence. 

lactic add dissobro ttie minerals of the toodi, thei^^ 

A topical monoclcmal antibody dieirsqpy to prev^ adhemice of & mutans to teetti has 

recently been developed The incidence of carlogenic bacteria^n hmnans and animals) and dental 

5 caries (in animals) was dramaticallyiediicedforperiods ofnp to two years after the cessation of 
antibody therapy. No advrase events were detected either in the exposed anunals or in human 
volunteers (Ma et aL 1998). The ammal requiranent for this antibo<ty in the US alone may 
eventually exceed I metric ton. Therefore, thisantibodywasexpressedvia&BchlDroplastgenonie 
toa(^ev6hi^ierbvebofes}^essionandpn>perfolding(PancbaIetaL200^^ Hie integration of 

10 aiitibody genes into the chloropla^gpnoin© was confimied The 
ejqnession of bodi heavy and light diams was confinned by western blot an 
conditionsasshDwninHg^TAandB. Tteeaqpcessionoffnlly assembled antibody was coofi^^ 
by weston blot analysis mider mm-reductng conditions as shown in Fig. 7CL This is the first report 
of SQCcessM assembly of a nnilti-subanit human protein in tran^enic chloroplasts. Production of 

15 monoclonal antibodies at agcicultiiial level should reduce their cost and create new applicattons of 
monoclonal antibodies. 

Marker free chloroplast transgenic plants: Mosttransformationtechniqnes co-introduce a gene 
that confers antibiotic resistance^ along'with the gene of interest to impart a desired trait 

20 Rf gpTiftntrpg trnTtcfhmiMi flftlte m flntibifltic containing growth media pennits selection of only those 
cells that Iffiveincorporatedte foreign gpnea. Once transgenic plants are iegen«Bted, antibiotic 
T^gfem^ genftft serve no ttsefbl purpose but tfacv cotttinnc to produce their gene products. One 
among die primary cancems of gmtically modified (GM) oops is die preseooe of clinically 
impprt!!n» ^ntiKintSe fegngtmce gene piodocts in transgenic plants that could inactivate oral doses of 

25 the antibiotic (reviewed by Puchla 2000; DanieU 1999A). /dtematively, die antibiotic resistant 
genes could be transferred to pathogenic microbes in the gastrointestinal tract or soil rendering them 
resistant to treatment with such antibiotics. Antibiotic resistant bacteria are one of the major 
r^fliiftng^ of modem TPftdicmft. hi Gerriiany.GM crops coiitaining antibiotic resistant genes have 
been banned fiom release ^eerenboom 2000). 

30 CSiloroplast genetic engine^ing offers several advantages over nuclear transfimnatiQn 

inoliiHmg high levels of g^ esqyression and gene cootainmoit bnt utilizes thousands of copies of 
themostcommonlynsedaxrtibiotioresistBnoe genes. Engineodnggeneticallymodified (CM) crops 
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wifhcmt du: iise of antibiodc resistance g 

envHonment or gut microbes. Therefore, bctaine aldelqfde deh^ 

spinach is used herein as a selectable madcerCDam^ The selection process involves 

conversion of toxic betaine aldehyde (BA) by the chloioplast BADH enzyme to nontoxic glycine 

5 belaine, which also serves as an osmoprotectanL Chloroplast transfonnation efficiency was 25 fold 
hi^ier in BA selection than spectinomycin, in addition to rapid regeneration (Table 1). Transgenic 
shoots appeared within 12 days in 80% of leaf discs Oqp to 23 shoots per disc) in BAselectian 
conqpaicd to 45 days in 1 5% of discs (1 or 2 shoots p« disc) on spectmomycin selection as Aam in 
Fig.8. SoafliecnUotsoonfinn stable jntc^intt^ 

10 (-10,000 co|ries percefl) cesoWi^inhanKqplaaniy.Ttan^e^ ISU - 1816% 

higherBADHaG(h%atdS£&ientdeveilopmen1als^ TEan^enic 
p inntg ^yi^Tft mnipiti^-iftgiftflny indifiKngntQitiiKlft frnrn imtransfoimed plants arid the introdaced trait 
was staWymberited in the subsequent generation. This is the first rq)ort of genetic engineering of 
tiie chloroplast gpnome without the use of antibiotic selection. Use of genes that axe naturally 

15 pi^ent in spinach for selection, in addition to gene containment, should ease public concerns or 
perception of GM crops. Also, to should be very helpful in flie development of edible insuKn. 

E^iresaon of diolera toxm p sabnnit oligomers as a vacdne ui chloroplasts: CTB, vtten 
administered orally Qxbens and Hohngcen, 19H), is a potent nnicosal inununogen, which can 

20 ncatralizethetoxicityoftheCThok)toxinbyprevraitingit^ 

etaL 1998)i This is believed to be a lesiih of Kndmg to enkaryolic ccU snr&^ 
g;angKftgi'HftR , fficqittMrs present on the intestmal epitheKal surface, thus eKdting a mucosal inmiune 
ieq)onsetopatfiog*ng(rT«™*l^*^itl 1991) and mhanrangtheiriimuneresponsewhenchemical^ 
coiq)led to oOjer antigens ODertzhaufiJi and Elson, 1993; Hohngten et aL 1993; Nashar et al 1993; 

25 Sun etal. 1994). 

Cholera toxin (CTB) has previously been expressed in nuclear transgenic plants at levels of 
0.01 (leaves) to 0.3% (tubers) of the total sohible protein To increase expression levels^ we 
engineered Oe chlort^iast genoine to eoqiress the uninodified 
2000). We observed e3q[ttesaionofoli8omeric CTB at levels of4- 5% of total sohib^ 
30 as shown inFig.5A. PGR and SoutheniBlot analyses confirrnedstaMeintegpation of 

into flie cUoroplast genome. Western blot analysis showed that transgenic chlon^last expressed 
GIB was antigenically identical to conomercially available purified CTB antigen as shown in Fig. 6. 
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intestfaialnieinhraiiereceptorofch^ Tninsgenic tobacco planis were 

morphologically indistingnishsAlefi^ 

be stably inheritedmtheOTbseqoeotgeiieration^ 

Hie increased production of an efficient transmucosal carrier molecule and dtsHvery system, like 
CTB. in chloroplasts of plants makes plant based oral vaccines and fusion proteins with CIB 
needing oral adnmrisHation. a much more feasible wrnwch. .Ihese obsewalioiis estiiblish 
nncquivocally that chloroplasts arc capable of fonning disulfide bridgw to asseaable findgo 
piotrans, and ideal fiw raqpresaiMi of CIB fimon proteins. 



Polymer-proinsiiliDRecoiiibiiiaiitl^ Ctoepossibtejiisulincaiiressionsg^ 
iBdependent expression ofinsulin chains A andB, as ithasbeenproducedin£.«rfi forcommeicial 
potposeshithepasL B»e disadvantage oftinsmettuxi is AatRcoft does not fbnndisulfidehridgps 
in the ccnmlessfljcprotem is targeted to the periplasm. Bqxaisive in vitro asseoaibly after 
15 parificaticm is necessary for tins qjpioach. Therefore, a better 

ptoinsolin as a polymer fbsion protein. This mefliod is ideal because chloroplasts are capable of 
forming disulfide bridges. Using a smgle gene, as opposed to tiie individual chains, eliminates the 
need of condncting two parallel vector constiuction ptbceiBsea, as is the case fi 
chains. lhaddition,theiieedforindi»idnaIfermentationsandpurificationproceduresi^ 
20 by the angle genemethod. addition, pioinsulm requires less ptocessmgfoltowing extraction. 
Receafly, fte hmnan p»imfflisnlin gene was obtamed ftom Geaentech, 
prransulinwassub-d<MMriintoplK;i9 toferaTitatefiirlhe^ The nert stq> was to 

deagn pnnners to inakD dJotoplast eaqpression ve^ Since we are interested in pioinsulin 
esiprBSsion.lheS'primcrwasdesignedtobmd on the proinsulin sequence. This FWpri^ 
25 the 69 bases or 23 coded amino adds of the leader or pre-seqjience of prqiroinsd Also,tiie 
forward primer included flie enzymatic cleavage site for theprotease factor Xatoavoidthense of 
cyanogen bromide. Besides flie Xa-fector, a Smal site was hitioduced to faciliWe sobBequent 
snbdoning. The.order of the FW primer sequeace is Smal- Xa.fectot-Pt^^ The 
reverse prima mchided Bairiffl and Xbal sites, phis a short sequence with homok^ 
30 pUC19BBqueaioefollowijigthBpiohisolin^o&The29^ 

PCR2.1. AGVCfVP 50-mer was generated as describedprBvionsly CDameU et al. 1997) akH« with 
thcRBS sequence QAAGGAO. AmrthaSinalpartialdigBstioiiwaspetfoimedtoeliminatethe^ 
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codott of dw biopolynitt gene, decrease the 5(hw to a 4(hnw 

proinsolm sequence. Once the conect fcigmeot \m obtoined ty 

(etinimatiDg the stop codaa but indadiiig flie RBS site), h was ligated to tbe Xa^proinsulm fusion 
Sene resoWng in the constract pCR2.1-4(WCaPris. Finally, the biopolymer (40mer) B proinsulin 
fiision gene was subcloned into the chloro-plast vector pLD-QV or pSBL-QV and the orientation 
was checked in the final vector using suitable restriction sites. 

E^resrionaiidPurifiattonoftteBiopolymer^nrfiiSBMfiisknprotdnr XL-1 Bhestrainof 
B.cc4i containmgpLI>OC-XaPris andflie ne^jrtivc conttob, whidi inchided aplaamid contoming 
flttgrneinterevBrse (mentation andlheJloiBstrainwiflio^ 

Cai pellets w«e lesnsfiended in 500 |d of antocteved dH»D or 6M Gnaniane l^drochkrite 
phos5>halB buffer pH7i) were sonicated and centrifiiged at 4DC at 10,000 g for lOmin. Afier 
centrifbgation, the supenialants wem mixed with an equal voto 

HCa, PH 8, 100 mM NaO). Tubes were warmed at 420C for 25 min to induce biopolymer 

15 aggi^tinn.Thenaiefiisioopiotdnwasreccvcmdbyc«aitrifuging 

Sanq>les were mn in a 16.5% Tricme gel, transferred to tiie nitroceUulose membrane, and 
immunoblotting was performed. When the sonic extract is in 6M Guanidine Hydrochloride 
Phosphate Bu£fe,pH7.0,themole«Jlar weight changes from its original and correct 
ahigherMWofajiprTOriinatetySOMJaassihowninFigs. This is jHobabty due to ihe 

20 canftomatiai of dke biopolymer in flds buffer. 

The gd was fiQt stained with a3M CuCb and flien the same gfH was stained wiA 
Commasne R-250 Staining Srfution fer an hour and ftcn destained for 15 nrai first, and thm 
ovemi^ CuCb creates a negative stain (Lee et al. 1987). Polymer protons (without fusion) 
sppcat as clear bands against a bhie background in color or daric against a light semiopaque 

25 background as shown in Rg. 9A. Tlris stain was used because other protein stains such as 
Qwmassie Bhie R250 does not stain die polymer protein due to the Jack of aronmtic side 
(McPhersonetaL, 1992). Therefore, theob8ervationoflhe24kDaproteminR250stainedgdas 
shown in Fig. 9B is due to die msuHn fiision widi die polymer. TWs observation was fiirdier 
confirmed by proWngtiiese blots wifli die anliJiumanpioiiisulinairt^^ As antfcqialed, die 

30 polynierinsulinfiisionpiW«nwasohservedinwestOTblotsasriiDwninRgs.l^ 

proteinsobservedioHgs. lOA-CaretetnaneraadheHunercomplexesofproinsulin. Itiscvident 
diatdieinsulin^lyma-fiisionpn)tdnsBrBslablein£.co«. Confirming fliisobservBfion.recaidy 
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SS^^SSXt the PBP polymer protem cunjngates (ydthlhiaiedam and tendamistal) 
imdeigothennallyiBverabtepliaKtninsa^ 

(Meyer and adIkoti,1999)L TlieseresnhscleaitydeoraastiatelJiatiiisiito 
the inverse tempenitnre transit One of the concerns is the stability of 

5 insulinattemperatiiiesusedforAeimallyTeveisi^^^ Temperature inducedpioduction 

ofhnoianiiisulinhasbeaiincanmcraaluseCScln^ 1999> Also, &e temperature tiamdiion 
can be lowered by inraeasing the ionic ^rength of the solution doting purification of this PBP 
(McPheis<metaL l99Q.Tliu3.GVGVP-fusioncouldbensedtopuiifyaffliihit^ 
important iMoteins in asimple inexpeoaive 3tq>. 

10 

Biopolymer-proinsiilin foam gene eipresnon in cUwoplast: As described in section d, 
cWoroplast vector im bon*arded into ±e tobacco cUoroplast 

(Danidl. 1»7). PCSlandSoufliemBtotswBrepeifbnnedtDconfinnbiopi^nner^Hou^^ 
geneintegra&minlDcUcHpplastgenonie. Soiidiem blots show homoplasnQrm most To lines but a 
15 fbwshowedsomehrteioplasmyasshowninPig. 11. Western blotsshowlheejcprcssiooofpotymer 
pTTip.f,.«nfe.i»nrmtein in all transgenic lines in Fig. IOC QoantificatianisbyEIJSA. 

Protease Xa Digestion of the Biopolymer-prolnsulin fusion protein and Purification «f 
Proinsnlin: TheenzymaticcleavagBofthe&sionprotdntoreleasctiteproinsulmproteinfiwmd^ 
20 (ovGW)«wasinitiatedbyaddingaiefectorlOAproteasetofliepurifiedfiisionproteinatai^ 
(w/w) of qjiooxiniatBly 1-^. CSesvagB of the fiislon protein was nmnitoied by SD8-PA<ffi 
analysis. We detected cfcaved proinsnlin in the extracts isobted 

buffcras8howninK^lQA.andB. Conditicm8aicnofweingo|rtimi2»dfecanq?letecl^^ The 
XapiotBiB»hasbeensnccessMynsedpievion8lytocleaTO(GVGY%r^ 



30 



25 aL1992). 



Evaluation of chlorophut gene expression: A systematic approach to identify and overcome 

potential limitations of foreign gene expression in chloroplasts of transgenic plants is essentiaL 

Information gained herein nicreases die utiHty of diloroplast tran^»miation systm by scieatisls 

interested in cjpessing other forrignprotrans. Therefore^ 

transcription, RNA abundance, mJA stebiUly, rate erf protem 

folding aadbiologkadactivity.For example, the nite of tnmscriptianofthBi^^ 
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may be conqfflied with fte oqrossnig endog^ 

riOlA),ti8ingnm on lianscriptkm assays to drta^ leSiRNApioinotcar is operating as 
ejqwcted. Transgenic ddoroplastcootaj^ 

investigBtedtotestlheirtiaiisa^twaefBcie^ Similady.transgeneRNA. levels is monitored by 
5 notfliems,dotltotsandprimerCTtenaonTdativetoend^ 

results along with run on transcription assays should'provide valuable informaiion ofKNAstability, 

pn)cesan&etc. Wiftourpasteq)erieoceine]qjrcsdonofseveralfordgn^es,fi)^ 

appear to be extremely stable based OTmorthtan blot ana^ However, a ^^stematic study is 

valuable to advance utility of tins system hy ottier scientists. 
10 Important^, Ae efBriencyoftiandationm^betcdBdinisolaledcWoiciij!^ 

with tbe highly translated cWoroplast protein (psbA). Pulse chase eotpethneats help assess if 
transiational pousmg. prematme terminatioa occurs. Bvatoatioa of percent BNA loaded on 
polysomes or in oonstrncts with or without STJTRs helps deleimine Ite 

binding site and 5' shaiHlooptranslBtionalenhancBn: Codonqptiinizrdgpnes are also conq)aied 
15 withiinrnodifiedgeaestoim«stigateAerateoftramlation.pansingandteniii^^ Inouriecent 
e^ence,we(*s«veda200-folddi£ferenceinBccnnndationofforeignproteinsdnetodeci^ 
in proteolysis confiaied by a putative chaperonin (De Cosa et aL 200 1). Therefore, proteins ftom 
constructs ejqjressir^ or not caqaessing fte putative chaperonin (witii or without ORF 1 +2) provide 
valuable information on protein stability. Thus, all of thisinformationmaybeusedtoimprovethe 
20 next generation of diloioplasl vectors. 

OptimizaliiHKrfgHiecipres^H: Wehaveiq)ortedth8tfi)iagngaiesare.eaqra!S8edbetween3% 
(c»y24a2) and 46% (co'2/<a2 operon) ia transgenic chlonjpl^ 

2001). Sewaisppmad]Bsmay}»vseitoeabmBepamla^^ ^ 
25 chtoioplast8,tBnsct?>tional regulation as a botfle-neck in gene oqrossionha^ 

Btilizingthestrongconstituitivepromoterof the 16srRNA(Pim). OneadvantageofPrmisthatitis 
recognized by botii die chloroplast encoded RNA polymerase and the nuclear aacoded diloroplast 
RNA polymerase m tobacco (Allison etaL 1996). Several investigators havBUtDizedPnn in Aeir 
studies to overcfflne die initial hurdle of gene eq>ression,transcriptiott(PeQ)saetaL 200^^ 
30 aL 1999, Staub etaL 2000). RNA stabiHly appears to be «iearnongdieleastpr6blenis because of 
obseivation of excessive nonnnnlnttnn of farogn transcrqits, at times 16,96fr^fold higher dian the 
hi^eaqaessingnndeartiaBSgBnicplanls (Lee otaL 2000). Alsc^ other investigBSons regarding 
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KNA stabil^ in ddoroplasis suggest fliat rfforts fi» optumzmg gene axpressioo need to De 

addnassedattliBpost-lranscni»tioiiallevd(ei^ Onrwadcfocoseson 

addiessingproteinei^ressioniwst-tiaiiscii^fti^^ FOrexan^le,5=and3»XJrRsaieneededfiK 

opthnaltranslaticmandmRl«Ast*lffilyofchlo«^ Optimal ribosomal 

5 bindingsites(»BS==8)aswenasaslm4oopstmrtiireIocate^ 

efficient tiandation. Arecent stu^ has shown that replacement of ttie Shinfr-Delgamo (GGAGG) 
witti the pshA 5= UTR downstream of flie 16S litNA promoter enhanced translation of a foidgn 
gene (CfOS) hundred-fold (EM et aL 1999). Therefore, the 20W»p tobacco chloroplast DNA 
fragment (1680-1480) containing 5= pabA UTR may be used. This PGR product is inserted 

10 dowiistreamoftiieieSiitt^ApromolBCto enhance transhtionctf 

Yetanoflier^)proadi£breiihancemcDtofttanslalionistooptinrizecodMico^ ^® 
have campanriA4^I% content of anftmgpggieslhathadbeeag^^ 
wiflithepeicentagpafdiloro^astei^resfflOD. WefomidftathigJierlBvdsofA+TalwayscQndalBd 
wahhi^ expression levels (sec TsiUb 2). It is also potentially possible to modify cUoroplast 

15 prolease recognition sites ^rtnle modifying codons,wTftontaffix^ 

Thraefoie, optimizing codon conqwsitions of insulin and polymer gaies to matdi the psbA gpne 
diould enhance flie level of translatitm. Although rbcL (RuBisCO) is the most abundant protein on 
earfli, it is nrt translated as highly as the psb A pne due to flie extremely high turnover of the psbA 
geneiaoducL The psbA gene is under stronger selection for increasedtranslatian efficiency and is 

20 tiie most abundant IhylakDidproteni. hi addition, flie codon usage in higher plant chlon^ 

biased towards IheNNC codon of 2-felddegBBenitegioi)^ C»^. TTCoverm^ OACover OAT. 
CAC over CAT, AAC over AAT.ATC over ATT. ATA etc.). Thbisinaddiliootoaatrongbias 
tow«idsTat&ethiidpo8i1ionof4-fi«dc«t!iw8tegroui)8. There is also acontcsctcEfcctaiatshoiird 
be .taken into consideration wtile modifying specific codons. The 2-fold di^erate sites 

25 iimnediat«^i?iBtream from a (aWcoAm do not show this bias towards NNC. (TITGGAis 
preferred toTTCGGA^e TIC CGTisprefered to TTTCGT.TTCAGTtoriTAOT 
TCP to TTT TCT, Morton. 1993; Morton and Bemadette, 2000). hi addition, hi^y expressed 
chloroplast genes use GNN more fireqpeally fliat otiier genes. The disdosum of site 
http7A»wwJcazusa.or.jp^codon and ht^)://wwwjiclrijihnjrih.gov may be used to qptfanizB codon 

30 con?K>sition by comparing codon usage of d«aentpUmtspede9= genomes and PshA-flgene& 
Alnmi}i"«> «»mift ncid-q in chloroplasts and tRNAanticodonspresentin chloroplast may betaken 
into consideration. OptmrizaticmofpolymerandpnMnsulminBybeper&imedusinganovelPCR 
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appmach (ftodrcmm ami Peari, 19««; Caidnri^ 
our bbraatary to optimize codim ccm^wshifm rf^^ 

VeetoroHistnicttons: pLDvwto is wedfiHraU the constructs. This wctnr was devdopedfijr 
5 chloioplasttransftttmati<m. ltcoiitainsthftl6SiRNApiomoter^)drivingtbese!ectablemaiker 
gBoeaodMCamiiioglyGosideadenylliansfisnw 

the omltqtle domng site and then flie 3* region (the tenninator fiom a gpne coding for 
photosystmnreactioiicentercoiiipottenls)ftomthetob3ccocmoropla8tge^ UiBpU) vector 
is a universal chloioplast raqaression /integcatiMi vector and can be used to transfoan diloroplast 

10 @eiKmesofseverdotherplairt8pedes(I)ameUetal.l998,DaiiielllW^ 

scquencesaiehighlyconservedaiiionghi^plants. TheimivOTalyBctoriisesrr«4aiid<ntfgexies 
(cUoroplasttransfcrRNAs codingfer Alanine andlM!enrane)fiamteiiivei^ 
tobacco chloioplast genome as flmlring sequences fig homologous lecombinatiwi. Becausethc 
univeisal vectnr integrates fbreigB genes wiflim the Inverted Repeat region of the cUoroptost 

15 pnome,itshDuMdoublethBcoivmmiberofaietiansgene(ftom 

tobBCCo> FnrttiBrmorB, it has been demonstiated that homoplasmy is achieved even in the first 
round of sdection m tobacco probaMy because of flie presence of a chloioplast origin of repHcation 
within flie flankmg sequence m the bniversal vector (thereby providing more temphites for 
integration). These, and several other reasons, foieiga gene expressioii was shovm to be mudi 

20 Mgher^rfientheuniversalvectorwasnsedmsteadofflietobacco^edficv^ 

CIB-PndnsnfinVeetorCoiiBtnictiMi: ThBehloroplaste)q?ie8aionvectorpII>-CIB*oin8inay 
- becaistinctedasfbllowB. FifB^bo&pnrinsolinanddioterat^ 

fiomsuitableDNAuangprmiersequaices. Primer IcontainstteGGAGGchloro^ 

25 nW>mebnidmgsitefivenncleotides»qpstrBamofthestartcodon(ATG)fortheCm 

suitable restriction enzyme mte (SpeO far insertion into Ae chloroplast vector. Primer 2 eliminates 
the stt^ codon and adds the first two amino acids of a flcidblc hinge tctrapq?tidc GPGP as reported 
byBcrgcrotetaL(199UtofaciKtatcfi)ldingof4eCnBi»roinsulinfu^ PrimcrSadds 
tiB remaining two amino acids for fte hinge tetra^iqrtide and elimi^ 

30 native pre-pioinsulin. Primer 4 adds a smtable restriction site (Spel) fijr subcloiiing into the 
chfciq)lastvector. AnqiKfiedPOl products may be mserted into the TAclonmg vector. BofeAe 
CIB and pronisulin PGR fiagmenis maybe esuasedattheSmal and Xbal restriction si^^ Ehited 
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fiag^nentsiireliertedmtolteTAclonuigve^ IteCTB^msulinfiaginBiitfliaybeexci^ 
tiieEcoRIates and inserted into EcoM digested dqjho^^ 
The foltowring VBrtois may be designed to <?rt^^ 
piodm^ of proteins wiflifte same amim add 



10 



19 



a) Using tobacco planis, Eibl (1999) demonstrated, in vxw. tbe differences in translation 
effici£«<^andmKNAst*i%ofaGUS reporter gene diietovarious 5= and 3=ra 
regions CUTR==s). Tins already desmTxri systematic transcription 
canbeusedinapraeticdendeavorof insulinprodndiii^ ConsistentwithEM=8(l999)data 
fainoeasedtraiBaatioiiefiBciaicyandinl^ 

addition with fljepshAS-OTRabeBdy in use. Hie 200 bp tobacco chlproplast DNA 
fiagment containing 5= psbA UTR may be amplified by PCR using tobacco cUoroplast 
DNA as ten?>late. This ftagmcmt may be cloned directly in the pUD vector nmltiplB cloning 
sitedownstreamofdiepromoterandlheaadAgBne. Ihc cloned sequence may be exactly 
the same as in the psbA gene. 

b) Another ^iproach of protein production fa diloroplasts involves poteotial insoBn 
oystallizatioa for fedlilating purification. The co'2Aa2 BadOm thurlngumsis apeam 
derivedpatativedi^peromnmay boused. E:q)resdonoffteoy2<o2opcjoninchlon)plasts 
provides a modd system for hypcr-ejq»ies8ion of foreign proteins (46% of total sohible 
protean) inafolded confignnrtion enhancing their stability and fius^ 
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Cosa 6t aL 2001). Thisjiistifiesincli^onofthBpDtatovBC^^ 

q)enm in one of the newly designed co^ In Aisie^ there are two open reading 
ficames (Om and ORF2) and a ribosomd binding si^ Hiis sequence contains 
dements necessary for Oy2i4a2 crystallization, 
5 subsequent purification. Successful crystallization of other proteins using this putative 

cbapcroninhas been demonstrated (GcetaL 1998). 'nieORFlandORF2oftbeBtCry2Ax2 
(^)eronmaybeaiiq»MedbyPaiusingti[iecon^leteoperonasaten^^ Subsequent 
cloning, using a novel PGR technique, allows for direct fusion of this sequence immediately 
upstreamof die proinsulin fusion protein wiflwot altering thenucl^^ 
10 nCTmaUynecessaiy to provide a restrictkmen^rme site (HOTtonetaL 1988). 

c) To address codon drtimizationlhc proinsuKngpnenKQr be subjected to certainmodificatiDns 
insobseqaeaitcoostnicts. TlieplastidnK)difiedpnMnsulin(PtPri^ 
sequraice modified such that the codons are optfanized for plastid expression, yet its amino 
add sequence remains identical to human proinsulih. PtPris is an ideal substitute for human 
15 proinsulininlhcCTBfusionpeptide. Thecjq>resaonofthi^ 

the native human promsulin to determine the affects to codon optimization, which serve to 
address one relevant mechanistic parameter of translation. Analysis of human proinsuKn 
gene showed that 48ofits 87 codons were the lowest fi»quency codons in the cWorop^ 
the amino acid for which diey encode. For example, there are six different codons for 
20 leucine. Thwr fiequency within tiie chtaoplast genome ranges ftom 73 to 30.8 per 

thousand codona. There are 12 leucmes in promsuKn, 8 have the lowest fiequoi^ 
(73X and none code fi» to Wghestfieqiica^ codons (3^^^ In the plastid. optimized 
pcdnsalin ^jaeall the codons code for fte mostfiequcnt^ whereas inhnmanptoinsuKnover 
halfoffte codons are die least ftequent Human proinsuhn nucleotide sequence contains 
25 62% C-Kj, whereas plastid optimized proinsulin gene contain 24% C+G. G^ierally, lower 

C+G content of foreign genes correlates with higher levels of raqnession (Table 2). 
d) Another vision of the proinsulin gene, mini-proinsulin (Mpris), may also have its codons 
c^rtimized for plastid expression, and its amino add se^^ 
proinsulin (Pris). Pns= sequence is B Cham-RR-C CSiain-KX-A 
30 sequenceisBCaiaiiHKR-ACaiain. TheMPrissequenceexchidestheRR-CChain.whichis 

ncmnally excised in proinsulin niaturatiOT to insulin. The C chain of proinsuBn is an 
unnecessary part of w vitro production of msulin. Pioinsulin folds propcriy and forms the 
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apiHopriatedisulfiabbiHidsmfliBabsc^ The rcmainingKR motif that 

exists between the B chain axid Ibe A cb^ 

upon cleavage with ttypda and cartxixypepti This construct may be used for our 

bic^olymer fosion protein. Bfs codon cptiiiDd2atidn and amino add sequaM» is ideal for 
5 matDie insulin piodoction. 

e) Ourcoirenthumanproinsulin^ttopoljraerfusionprot^ 
site^wHchservesasacleavagj&pointbetweendiebiopolynierand Currently, 
cleavage of the potymer^insulin fusion protein with the factorXahas been inefficient in 
ourfaands. Therefore, we replace this cut site with a trypan cot ai^ This elinihiatBS the 

10 need for the expensive festorXa in processing proinsuli^ Since pioinsnHn is canrently 

processed by tiypdn in the foanatiGn of mature msnlin, insnHn matnratiQn and fusion 
pqrtidc deav^ge can be achicvrf in a singje stq) whh tiypM 

f) We diseTvedinconqrfdetiansMonpioduirtsinplastids when we e^ 
(OttdaetaL2000). Therefore, white eoqffessing^polymer-piomsulinfb^ 

1 5 decreased tije lengdi of the polymer protean to 40mer, withcmt losing die thennal responsive 

proper^. In addition, c^tnnalcodons for glycine (GGTI) and valine (GTAXw^ 
80% of the total amino adds of the polymer, have been used In all nuclear encoded genes, 
glycine makesiq) 147/1000 ainino adds while in tobacco chloroplasts it is 129/1000. Hi^ 
expressing genes like psbA and rbcL of tobacco make up 192 and 190 gly/1000. Hierefore, 

20 glycme may not be a limiting fector. Nuclear genes use 52/1000 proline as opposed to 

42/1000 in chkHcplasts. However, currently used codonf0rpr61ine(CXXOc^ 
to CCA or CXJr to ftrflmeaihance translation- It is known that pathways for proline and 
valmc are compartmentalized in diIorophsts(Guda^ Also, proline is known to 

fK*^|iVffi1iitft m chknmplasts as an osmoprotcctant (Danicll ct al. 1994). 

25 g) Codon con:iparison of the era ^e vw&psbA, showed 47% homolp^ 

Sequent codonsofthepsbA gene. Codon analysis showed that 34% ofthecodons of CTB 
are complimentaiy to tiie tRNA population in the chloroplasts in comparison with 51% of 
pebA codoDS tfiflt aTT> (wptin^ftntflTy tr\ the chloToplast tRNA popalatian. Becauseofthe 
high levels of CTB e^qnession in transgenic chkxroplasts. (Henriques and Danidl, 2000% 

30 there will be no need to modify die CTB gene. 

DNA sequence of all constructs may be determined to confirm the correct (mentation of 
g^es, in fiame fosion, and accurate seqoences in the recomhmant DNA constructs. DNA 
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ABIPrismDyeTCTninationC^leSeqiien^ InsertionsitesatbodiendsinaybeseqD^^ 
iijging primers for each stramL 

Bsqnessioii of all chloxoplast vectocs aie first tested in E. coli before Iheir use in tobacco 
5 tnmsfonnation because of tbe similarity of protein synthetic machinery (Biixley et al. 1997). For 
Escherichia coli expses^ia^l'^^ JE:.a>/i be transfonned by a standard 

CaCb method. 

Bombardment and Reg^imticm of Chloroplast Ttansg^c Plants: Tobacco (fficatiana 
10 tabacum var. Petit Havana) and nicotine fiee edible tobacco (LAMD 605, gift fiom Dr» Keitti 
Wycofi^ Planet Biotechnology) plants may be grown asq)tically by gennination of seeds on MSO 
mcdimnCDaniell 1993). Foltycaqpanded,daricgrecnlcavcsof abouttwomonthoMpl^ 

used for bombardment 

Leaves maybe i^acedabaxialsideiq) on a WhatmanNo. 1 filter paper laying on the RMOP 
15 niediiim(DanielU 1993) m standard petri plates (100 x^^ Gold^.6pm) 
microprojectiles may be coated wi&plasmidDNA (chloroplast vectm 

out with Ae biolistic device PDSIOOO/He (Bio-Rad) as described by Daniell (1997> FoUowing 
bombardment, petii plates are sealed with parafilm and incubated at 24 □ C under 12 h photoperiod. 
Two days after bombardment, leaves are chopped into small pieces of -5 mm^msizeandplacedon 
20 fee selection medium (RMOT cootaming 5 00 jigtal of spectinomydn dihydiochloride) with abaxial 
side touching tte medium in deqp (100 x25 mm)petriplates(--10pieo^ 
sijcctinoraydnresislantshootearedioppcdi^ 

petrii^ates(~5 pieces pcrplate) containing ttie same selection medi^ Resistant Aootsfiom the 
secondcohurecyclearetransfmedtoflierootinginediumC^ 
25 mg^andspectinon^cmdihydrodiloride, SOOmg/liter). Rootedplanls maybe tiansfisaedtosofl 
and grown at 26DC under cootmuous lifting conditions for furtiier analysis. 

Polymerase Chain Reaction: PC31 may be performed using DNA. isolated ftim co^ 
transgBoic plants to distinguish a) true chloicplast transfoimants firom mutants and b) chloroplast 
30 transformantsftomnucleartransfonnants. Pthnerstetestmg the presence of the aadA.gpnc(fliat 
confers spectmomydn resistance) in transgenic plants may be landed on &e aadA codmg sequence 
and 16Si»NAgMie (primers IP&IM.). To test chloroplast mtegration of the insulin gene, one 
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primer lands on the aadA gene wliife anottier lands cm the native chloiqplast gsaaawe (pdmers 

3F&3M)l No pgr product is obtained wiAmcleartransgenic plants The 

primff set (2P & 21i^ may beused to test integpitim of the entile 

deletian or looting ont daring homologpns reconibination, by landmg on ibe lespective 
5 recombination sites, • A similar strategy has been used successfully by us to confirm chloroplast 
integration of foreign genes panieU et al, 1998; Kota et al., 1999; Guda ct al., 2000). This 
screening is essential to eliminate mnitants and nuclear transformants. Total DNA ftom 
unbombarded and tran^qmc plants nwy be isolated as describedby Edwards 
PCR analyses in transgenic plants. OdoroplasttransgBmcplantsoontainmgtheproinsulingcne^^ 
10 then be moved to second xound of selection to achieve homi^lasniy. 

Sovthmi Blot Anaj^ysb* Sondiem blots axe pedBaimed to drt^enntnft the copy number of the 
intEodocedfareigngpnepercellaswellastDtesthDmopIasmy. ThmaxeseverallfaousandcGidesof 
the chloroplast genome present in each plant ceU. Thoefore^ when foreign genes arc mserted into 

15 the chloroplast genome, it is possible tiiat some of tiie chloroplast gpMmes have fordgq genes 
integrated while others remain as the wild type (hetert^lasmy> Therefore, to ensure that only the 
transformed genome exists in cells of transgenic plants (homoplasmy), the selection process is 
continued. To confirm^t the wild type gpoome does not exist at the end of the selection cycle, 
totalDNA ftom transgenic plants sh<Hild be probed wifli the cWorc^lastba^ sequences 

20 (the tinI-4inA ftagment as shown in Figs. 2A and 3B. If wild type genomes are present 
Qi^eioplafflnyX the native fagment size is observed along with transformed genomes, lliepresence 
of a large fiagment (due to insertioa of feidgn gisnes wi 

the native small fii^eatit confirms homoplasmy (Daniell et aL, 1 998 ; Kota et aL, 1999; Qadsi et aL, 
2000). 

25 The copy number of the integrated gene is det^anined by establishing homc^lasmy for the 

transgenic chloroplast genwne. Tobacco CWoroplasts contain 5000-1 0,000 copies of iheirgenome 
per ceil (Daniell et al. 1998). If only a fraction of the genomes are actually transformed, the copy 
mmiber,bydefeult, must be less than 10,000. By establishing that in the trans^cs the msulin 
inserted transformed genome is ttie only one present, one can establish that the copy nmnber is 

30 5000-10,000 per celL This is usuaUydcme by digesting the total DNA wiA a suitable restriction 
enzyme and probir^ with the flanking sequences Hast enable homologous recombmation mto the 
chksoplast ^nome; The native ftagment presexit in die conto 
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The absence of native fiaginent psoves fliat odly Ixan^ 

cell and there is no native, untiaiisfcxmed^ chlaroplast genome; without the insulin gene pieseot 
Thi5f^blT>h'^tftfthfnnnp1ajCTm frirniiltaDeaustyTHOvidmgaaestimate 
of 500(^^10,000 copies of the finrdgn genes per ceU. 

5 

Northern Blot Analysis: NorthcmblotsmaybcperforniedtotesttheefficiencyoftiansCT^ 
tiiBproinsulin gene fused wiACnBwpoljTnergra Total RNA is isolated fixim 150 mg of fiozai 
leaves bymingtiie*Ttoeasy Plant Total RNAlsolaiionKi^ KNA 
(10-40 |ig) is denatured by fimnaldehyde tieatment, separated ona I ^% agarose gel infhepieseooe 
10 offomialdehydeandtnoisfaiedtoanitroceMosenieEn^^ 

(1989). Probe DNA (pEioinsalin gene coding r^on) niay be ]d>e]ed by therandoin^iDinedittediod 
OProme^) with ^-dCTP isotope; The blot is thea pre4iyhiidi2»d, lq*ri^ 
described above fiffsoothmi blot analysts. Transcript levels may be quantified by IheMoleciilar 
Analyst Program usmg the G&-700 Imaging Densitometer CBio-Rad, Hercules^ CA) or &e like. 

15 

P<riymer^iillB fosion protdn pnrificatioii, quantitation and charactmzation: Because 
polymer insulin fusion proteins exhibit inverse temperature transiti on properties as shown in Figs. 9 
and 10, they inay be purified fomtransgenicplants essentially foUowing the same 
for polymer purification firom transgenic tobacco plants ^hang et al.,1996). Polymer extraction 

20 bufEer contains 50 mMTris-HO, pH, 7,5, 1%2-niecqitoethanol, 5mM EDTAand 2mM PMSFand 
0.8M:NaCL Ihehomogeimteislhaicenlnfugedat 10,000 gfo 10 minutes 
discarded Thesiycinatantisincnbatedat42DCfor30rnmutcsandth^centra^ 
for3minDt63atS,000g(room temperature). Ifinsulin is found to be sensitive to tins ten^exatur^ 
Tt is lowered by inoea^g salt ccmcenlratiQn (McPherson et aL, 1996). The pdlet containing fiie 

25 insulin-polymer fusion proteni is resoq[)endcd in the extraction b^ 

minutes. The mixture is ccntrifiiged at 12,000 g for 10 niinirtes (4DC). The s\4)ematant is then 
collected and stored at -20OC. The purified polymer insulin fusion-piotein is electrophoresed in a 
SDS-PAGE gel according to LaemmH (1970) and visualized by either stainmg with 0.3 M CuQa 
(Lee^aL 19S7) or transfeoed to nitrocellulose membrane andpnibedwiflianti 

30 thepolymerarinsuUnproteinasdescribedbelow. Quantification of purifiedpotymerprotemsmay 
be carried out by EUSA in addition to densitometry. 

After electicfdaoiesis, proteins may be transferred to a nilroceUulose mCThtane 
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eIarfnipharcdcaltym25mMTiis,l«iiiM^^^ The filte is Wodced 

with 2% dry milk in Tds-buffiied saline for two hows at room tenperatiire and stained with 
antisenim taised against the polymer AVGVP (kindly provided by Ihe Univereity of Alabama at 
^irmwigfiam fnrmnpJnnal famHty)nvemiflfctin2%drymillDrrrisfaiffer^ The protein bands 

5 reacting to Ae antibodies are visualized using alkaline phosphatase-linked secondary antibody and 
the substrates nitiobhie tetrazolium and 5-brDino-4-chiaro-3-mdolyl-pho^hat6 (Bio-Rad). 
AltematiYely, for insulin-polymer fusion iHT>teins» a Mouse anti-human proinsulin (IgOl) 
monocknal antibody may be used as a inrimary antibody. To detect the bmdii^ of the pnmaiy 
antibody to the recombinant proinsulin, a Goat anti-mouse IgO Horseradish Peroxidase Labeled 

10 monodoiial antibody CBPR)rnay be used The substrate to be iised for ccH^ 

be3^,5^Tetramed]^benzidine. Ptoductsm^bepurdiased fiom American QualexAntibodies 
in San Clemente;^ CA As a positive control, hmnaniecombinaiit proinsulin fiom S^^ 
used. .ThishumaniecDinbinantproiiisulin wase3qHessedin£Lco£fbyasyn&eti 
Quantification of purified polymer fusion proteins may be carried out by densitometry using 

15 Scanning Analysis software CBioSoft. Ferguson, MO). Tckal protein contents may be detciminedby 
the dye-bindmg assay nsir^ xeagaits supplied in kit fiom Bio-Ra4 with bovine serum albumin as a 
standard. 

Characterization of CTBe:qpres$ion: ClBprotemlevelsintransgenicplantcrudeextractcanbe 
20 determmed using quantitative ELISA ass^ A standard curve may be generated nsmg known 
concentrations of bacterial CTB. A96-weUmicTOtit^plateloaded with 100 pl/well of bacterial 
C3B (conceiitratiQQS in the rai^ of 10 -1000 r^ is incubated ovCTi^ Ibe plate btten 

washeddiricewifliPBST(phosidudebufibKd8afi^ ThebadcgrouiKi 
may be blocked mcubation m 1% bovine seium albumin (BSA) in PBS (300 |d/well) at 3700 for 
25 2 h followed by washing 3 times widiPBST. The plate may be incubated in a 1:8,000 dihition of 
rabbit anti-cholera toxin antibody (Sigma 03062) (100 Dl/well) for 2 h at 37DC, followed by 
washing the wdls three times with PBST. The plate may be incubated with a 1:80,000 dihition of 
anii-rabbit IgG amjugated withalkalinephoshatase (100 Dl/well) for 2 h at 37DC and washedtinice 
withPBST. Then, 100 □ I alkaline phoq)hatase substrate (Sigina Fast p-nitrop^ 
30 in 5 ml of water is added and diereaction stopped with IMNaOH (50 Dl/well) vAm abs<vbanctes 
in the rnid-rangeofdie titration reach about 2.0, or aflw 1 hour, vrtricheverw Theplateis 
tiien be read at 40Snm. These results are used to generate a standard curve &om which 
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ponccntraSoiisofplant protein are eadi^lated. Thus, total sobble plant protein (conceiitr^ 
previously detendned using tilie Bradfcnd asssry) in bicaxbonate buffer, pQ 9.6 (15mM N^iQ>3, 
35mM NaHCQj) may be loaded at 100 plant DlAvell and the same procedure as above can be 
repeated TheabsoibancevaluescanbeiisedtodeteamunetfaenitioofC^ 
5 plant piotein, using tbe standazd curve generated prcvionsty and the Bradford assay results. 

Inheritance of Introduced Foreign G«ies: While it is unlikely tfiat introduced DNAmove fiom 
the chloroplast genome to nuclear genome, it is possible that the gene can be ffltegrated in tiie 
TTTT^.tffT gftPA^nft A mng hnmhardment and remain undetected in Sonthem analysis, Theiefor^in 

10 initial tobacco ttansfonnants^ s(»ne are allowed to self-pollinate^ whereas o&m ase used in 
rec^irocal crosses widtcontiDl tobacco (transgenics as fianale accqpters andpoUen donors; testing 
formatexnal inheritance). Harvested seeds (H) are g p nninnt Fi d on media r ontaining ^^ecfeu 
Achievemoit ofbomoplasmy and mode of inheritance can be classified by looking at germinatian 
results. Hcmoplasniy is indir?*^ by totally green seedlings (Daniell et aL, 1998) while 

15 hetBtoplasmyisdisplayedbyvariegatedleavesOto«A^ofpigi^^ Lack 
of variation in chloroph^ pigmentation among progeny also underscores the absence of position 
effect, an artifect of nuclear transfonnatioa Maternal inhKitance is demonstrated by sole 
transmiffiion of mtioduced genes via seed generated on transgenic plants, regardless of pollen source 
(green seedlings on selective media). When transgenic pollen is used for pollination of control 

20 plants^iesultantprogBoydonotcontainresistancetocheniicalinsete^ 

Svab and Maligo, 1993). M61ecolaranalysescancontotcansmissionande^HBSffl<Miofintr^^ 
genes, and T2 seed is gcacrated fixrai those confirmed plants by the analyses desc^ above. 

Comparison of CnnraitParification with Polymer-basedPorifi^ Itisinqxniant 
25 to compare purification m^hods by testing yield and |Hirity of insulin produced m E.coli and 
idbacco. Three mirthods may be coirq)ared: a standard fusion protein in £lco/^, polymer proinsuhn 
fusion iHt)tein in Kcoli, and polymer proinsulin fusion in tobacco. Polymer proinsohn fusion 
peptide fiom transgenic tobacco may be purified bymediodology described in section c) andDaniesIl 
(1997). jSlco/Zpurificatiaii is pafonned as follows. One KterofcachpLD containing bacteria is 
30 grown mLB/anqncillin (100 Qg/hil) overnight andthe fusion protrin, eitherpolymer-proinsulinor 
tiie control fosion protein (Cowley arid Madkm 1997), ej^ressed. Cells are harvested by 
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centrifug^tioa at 5000 X g for 10 min at 4DC; an^ 

wt Bacteria) of 100 0iMTx]s-HC3,pH 73. LysozymeisaddedataconoentratiQaof ImgAiiland 
placed on a lotatmg shaker at room teaaperatme for 15 xnin. The lysate is subjected to probe 
sonkationfortwo cycles of 30 soD/30sQffat4OC Cdhilar debris is removedbycentrifagati^ 

5 1000 X g for 5 min at 40C. The Kcoli prodiiced proinsulin polymer fusion protein is purified by 
inveise temperature trandtion properties (Daniell et al., 1997)- After Factor Xa cleavage (as 
described in section c)) tbe proinsulin is isolated from the polymer nsing inverse traiQ>eratQr6 
transition properties (Daneill ^ aL, 1997) and subject to oxidative sulfitoiysis as described below. 
Alternatively, tiie control foaian protdn is purified according to Cowley and Maddn (1997) as 

10 follows. lhesiq)cinafantisiclahffidandceatrifug^ 

tbe inclusion bodies. Ihe supeniatant is Ihen discarded and die pellet zesnspeod^ 
(original wt Bacteria) of dHiD^ ahquoted into imcn>ceiitiifage tubes as 1 ml fiactians* anddien 
ccntrifngedatl6000Xgfor5iiiinat4DC- Tbe pellets are individually washed with 1 ml of 100 
mMTVisrHCa, pH 8.5, IMuiea, 1-1 THtimX-lOO and again wadicdwittil0OmMTrisHapH8.5,2 

15 Miirea,2%TrintonX-100. The pellets are then reaiq>emled in 1 ml ofdH;^) and transferred to a 
pie-weighed30mlCorexcentzifb^tube. Hiesan^leiscaitrifngaiat 15000 Xgfor5minat4DC 
and the pellet resu^)ended in 10 ml/g (wet wt pellet) of 70% formic acid. Cyanogen bromide is 
added to afinal conceotration of 400 mM andtiie sample incubated at room temperature in thedarit 
£3rl6h. Tlie reaction is stopped by transfening the san^le to a roimd bottom flask and re 

20 solvent by rotary evaporation at SODOC. The re^dueisresuspendedin20 ml/g (wetwt pellet) 
ofdHaO^shellfiozeninadiyice^ianolbatfa^andlhrailyopbilizBd Urn tyophDizBd protein is 
d]ssolvedin20myg(wetwtpeUet)of500mMTris-HCLpH82>7Mnrea. Oxidatiye sulfitoiysis 
may be perfimned by adding sodmm sulfite and s(^^ 

and 10mM» respectively, and incubating at room temi^^ This reaction is stopped lyy 

25 fieezingondtyice. 

PnriflcatiDn and folding rfHumanProinsnlin: The 5^sulfonated material may be applied to a 2 
ml bed of Sephadcx G-25 equiKhrated m 20 mM Tris-HCl, pH 8 2, 7 M urea, and tiien washed with 
9 vols of 7 M urea. The collected fiacticn is applied to a Phaxmada Mono Q HR 5/5 cohmm 
30 eqoih1iiatedin20nd^TnsHCl,pH8.2,7Mureaataflowrat^ Almeargradient 
leading to fbwIconcCTtration of 0.5 MNaQ is nsed to ehite the bound materia 2min(2ml) 
finctions axe collected during the gradient, and protein conc^oArarion in each fiaction determined 



wo 01/72959 PCTAJSOl/06288 

t6S 



1465-PCT-00(1577-P-00) - « 

. Punty and molecular oiass of £ractioDS is e^imatedby Tdcine SDS-PAGE (as shown m Fig. 2), 

whmTridiie is iised as the trailing ioa to allow betteriesohitian of pqrtidesin theiange of 1 - 1000 

kDa. Appropriate ftacdons are pooled and to 

(sq)erfine)eqoilibiatBdin5niMainmonto The sample is collectedbased on UV 

5 abscnfoanceandfieeze^ed. The partially purified 5^sulfcmted material is 

mM glycine/NaOH, pH 1 0 3 at a final ccmccniration of 2 mg/ml p^merc^toetfaanol is added at a 
ratio of 1.5 molpermol of cysteine .J-snlfonate and the sample stirred at 4QC m an open container 
fi)rl6h. The sample is tiien analyzed by re vttsed^hasehi^-performanceEqm 
(EIP-HPLC) using a Vydac C* column (22 X 1 50 mm) equilihmted in 4% aceftooitrile and 0. 1% 
10 TFA. AdsQibedpq»tidesareehEtedwidialineargr8dientofinfa:easingac^^ 

(0.88%pcrmmnptoama3dnramof48%). Tlie remaining refolded proinsulm is ccntiifiigpd at 

leOOOXgto remove insohibleniaterial, and loadedonloascm^^ 

250nim). The boimd material is then ehited as described above, and the pioinsu^ 

lyophilized. 

15 

Analysis and charaeterliatlon of insnfin expressed in ReoU and Tobacco: The purified 
expressed proinsulin is subjected to matrix-assisted laser desorption/ionization-time of flight 
^ilAIJ)I-TOI9 analysis (as described by Cowl^andMacJdn, 1997), using proinsulin from Eli lilly 
as both an internal and extanal standard. To detemiine if the disulfide bridges have formed 
20 conectly naturally mside ddaniplasts or by tn vitro processings aproteolytic digestion id peifonned 

proinsuKn are lyophilized and lesuspcaMledm PtoteaseVSis 
addodataratbof l:S0(wAv)inG3q9erim6ntal8amplesandno^^ AH 
samples are Ami incubated ovemifijit at 37DC; 4e fcacdons stopp^ 
25 ggmplfts stored at -2QDOC until analyzed ThesamplesareanalyzedbyRP-HPIX^uangaVyiacQ 
cohamn {22 X 150 mm) eqnihbrated m 4% acetonitrile and 0.1% TFA, Bound material is then 
eluted usmg a linear gradient of increasing acetonitrile concentratian (0.88% per min up to a 
msxinnmi of 48%). 



30 CTB-GMl gangUodde binding assay: A CiMl-ELBA assay may be peifoaned as described by 
Arakawa et aL (1997) to detennine the affinity of plant-derived CTB for CMl-gangLioside. The 
microtitcrplate is coated withmonorialog?xi^08ide-GM (SigmaG-7d41)byincubating the plate 



wo 01^959- PCT/USOl/06288 

166 



with 100»il^«DofGMl (SJO^iEfniSi'mhkaaba^ Altanativcly. 
the weUs can be coated wito 100 fil/wdl of The plates are incubated 

with tninsfonnedplant total sohiblepfotem^ 
ovemi^ at 4DC Tlie remainder of the piocedu^ 

5 

Induction of oral tolerance: Four week old female NOD mice may, fiar exanq)le, be purchased 
fom Jackson I^iatory (Bar Haibor,NfE) and housed at an an^ Themiceaie 
divided into three gnw^ each groupconsistittgof ten mice. Eachgioiq>isfedQneafdiefoDowing 
nicotine free edible tobacca untransformed. expressing CHB^orexpressingC^^ 
10 pnrtein. B^gfamingatS weeks of ags^eadiniOTseisMSgofnic^ 
week until reaching 9 wedcs of age (a total of five feedings). 

Antibody ttter: Alten weds of age,, the sermn and fecal material are assayed feanti-CTBa^ 
anti-proinsuto antibo^ isotypes osiii^ the ELISA met^ 

IS 

Assessment of diab^c symptoms in NOD mice: The mddeoce of diabetic synqrtwns can be 
compared among mice fed widi control nicotine free edible tobacco that e^qpresses CTB and those 
thateoipresstbeCTB^insulmfnsionprotein. Starting at 10 weeks ofage, the mice aremonitored 
onabiweekly basis with nrinaiy ghicose test stiips(ClimstixandDiastix, Bayer) for development of 
20 diabetes. Glycosnricmicearebledfromdietailvemtocheckfbrglycan^ 

(Acco-Gbeck, BoehringBr Mannheim^ Diabetes is confinnedby hypcrghfcemia C>250 mg^dl) for 
two consecutive weeks (M&etaL 1997). 
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a. SPECIFIC AIMS 

Research on human pzoteins in the past years has revohitiojmed the use of these 
tfaerapeatically vahiable proteins in a variety of clinical situations. Since liie demand for &ese 
proteins is expected to increase ccnsideiably in tiie coming years, it would be wise to ensure that 
5 in the fatxm they will be available in significantly larg^ amounts, preferably on a cost-effective 
basis. Because most ^nes can be expressed in many different sj^stems, it is essential to 
detennine which system offeis the most advanta^ for the manu&cture of the reccmbinant 
protein. The ideal esqicession system would be one tiiat produces a maximum amomit of safe, 
biologicaily active niiaterial at a minimnm cost The use of modified mammalian cells with 

10 recombinant DNA techniques has the advantage of resubtng in products, whidi arc closely 
related to those of natural origin; however, cultniing of these celk is intricate and can only be 
caxned out on limited scale. Tt» nse of microoiganisms such as bacteda peimits manu&cture on 
a larger scale, but introduces &e disadvantage of producing products, which differ appreciably 
fitsn the prodocts of natural origin. For exsmple, proteins that are usually glycosylated in 

15 hmnans are not glyco^ated by bacteria. Fardieririoi»^ 

levels in K cott frequently acquire an uimatuxal confoxmationy accompanied by intracellular 
precipitatiofi due to lack of proper folding and disulfide bridges. Pcoducti<m of reccmlnnant 
proteins in plants has many potential advantages for genesaring biophaxmacenticals relevant to 
clinical medicine. These include the foUowiiig: (I) plant systmis ate more economical than 

20 industrial fecilities using fermentation systems; (ii) technology is available for harvesting and 
processing plants/ plant products on a large scale; (iii) elimination of the purification requirement 
"wbsxi the plant tissue containing the recombinant protein is used as a food (edible vaccines); (iv) 
plants can be directed to targ^protdns into stable, intracellular compartments as chloroplasts, or 
expressed directly in chbroplasts; (v) the amcimt of recomtnixairt product that can be pnxtuced 

25 iqiipnMichcs industrial-scale levds; and (vO heaMi risks due to contamination wi& potentiai 
liTiman pathog^is^xins are mhdmized. 

It has been estimated that one tobacco plant should be able tn pmAica mtm Te twnhmant 
protein tiian a 300-litar fermenter of K coli (Crop Tech, VA). In addition, a tobacco plant 

30 produces a million seeds» j^cilitating large-scale production. Tobacco is also an ideal choice 
because of its reiative ease of gen^ic manipulation and an wap&a^ojag need to e?q)lore alternate 
uses fiff fliis hazardous crop. However, with the exception of enzymes (e.g. phytase), levels of 
foreign proteins produced in nuclear transgenic plants are generally low, mostly less than 1% of 
the total soluble protein (Knsnadi ct aL 1997). May et al. (1996) discuss diis (Hoblem using the 

35 following exanqples. Althoii^ plant derived leoombinant hqiatitis B sur&ce antigen was as 
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xmuneicial recombinant vaccine, the levels of esqinession in transgenic tobacco 
were low (0.0066% of total soluble protein). Even though Nonvalk virus cc^d jirotein 
expressed in potatoes caused cnal inmmnization when consumed as food (edible vaccine), 
expression levels wim low (03% of total soluble ptotein). In particulai^ expression of human 
5 proteins in nuclear tran^enic plants has been dis^pointingty low: e.g. human interferon-^ 
0.000017% of fiesh we^i, human serum albumin 0,02% and ^rytbropoietui 0.002^ of total 
soluble protein (see taldel m Kusnadi et aJ. 1997). A synthetic gane coding.for the human 
epidermal growfii factor was expressed only up to 0.001% of total soluble protein in transgBmc 
tobacco (Msf et aL 1996). The cost of prododng reccMubinant proteins m alfetfo. leaves yna 

10 estimated to be 12-fokl lower than in potato tubers and comparable with seeds (Kusnadi et aL 
1997). Hov^ever, tobacco leaves are modi larg^ and have much higher biomass ttffiu al&lfiu 
Planet Biotedmology has recently estimated that at 50 mg/hter of maimnalian cell culture or 
transgenic goat's milk or SOmg/kg of tobacco leaf e^qxression, the cost of punfied IgA will be 
$10,000» 1000 and 50^g, respectively (DanieU et aL 2000). The cost of production of 

15 lecombmant protems will be 50-fi>Id lower than fbat ofKcoli fennentation (with 20% expression 
levels m Rcoli (Kusnadi et al. 1997). A decrease in msulin eqxession fiom 20% to 5% of 
biomass doubled the cost of production in j51c0K(PetridisetaL 1995). Esqnression level less than 
1% of total sohdite protein in plants has been fimnd to be not oonimer^^ 
aL 1997)w TheFefore, it is important to increase levels of expression of recorribinant proteins in 

20 plants in order to eiq^loit plant prodocticm of pharmacologically inqx)rtant protema. 

An alternate approach is to express foreiga proteins in chloroplasts of hig^ plants. We 
have recently integrated fcxeign genes (iQ) to 10,000 copies per ceU) into tiiietc4>accochloroplast 
genome lesohh^ in accunnilalion of recombinant ^noleins up to 46% of the total soluble protein 

25 (Dc COsa et aL 2001). Chknoplast transfmnation utilizes two flanMr^ s^^pmnwys fl > ?it ^ tttrough 
homologoiis recombination, insert foreign DNA into the spacer mffoa betwe^ the functional 
genes of the chloroplast genome^ &us targeting the foreign genes to a precise location. This 
eliminalBS fbe **po^on efifecf and gene sQotcmg fiequently observed hi nuclear transgenic 
plants. Qdoroplast genetic engineering is an enviromnentalty fiiendly s^roach, minifnigMfig 

30 conc^ms of out^eross of introduced traits via poll»i to weeds or other craps (Bock and 
Hagemarm 2000^ Hdfetz 2000). Also^ the concerns of insects developing resistance to 
biopesticides are minimized by hyper-expressiom of single insecticidal proteins Qiig^ dosage) or 
expiession of difierent types of insecticides in a single transformation event (gene pyramiding). 
Ooncecns of insecticidal proteins on nonrtarget insects are minimized by lack of CTcpression in 

35 tn m s g coic pollen (Be Cosa et aL 2001X 
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Most inqjortantly, a sigoificant advantage in the productiQn of pharmaceutical pn^ins in 
chloioplasts is tfaeir abili^ to process eakaryotic proteins^ including folding and formation of 
disulfide bridges (Drescfaer et al. 1998). Cbaperonin proteins are present in cbloiopiasts (Roy, 
5 19S% Vierfing, 1991) ftat fimction in folding and assembly of prokaiyotic/^jkaiyotic proteiDS. 
Also, proteins are activaled by disulfide bond oxido/reduction cycles using ^ chloioplast 
tfaioredom system ^(eulland and NG^niac^Maslow, 1999) or chloroplast pnotein disulfide 
isomerase (Kim and Mayfield, 1997). Accamuladon of fully assembled, disulfide bonded form 
of human somatotropin via chlonyplast tiansG»mation (Staub et aL 2000), (^gommc finm of 

10 CTB (Hienriques and Damell, 2000) and the assembly of heavy/tight chains of immanized Gu/s 
13 antibo^ in transgBnic chloroplasts ^Panchal et aL 2000) provide strong evidence §ost 
successful pR>cessing of pharmaceutical luteins inside chloroplasts. Such folding and assembly 
should eliminate die need for highly e3q)ensive in vitro jnrocessing of pharmaceutical proteins. 
For example, 60% of the total operating cost in the pnxhictian of human insuHn is associated 

15 widi in vitro processing (fanuation of disulfi^ bridges and cleavage of mettuoninc^ Pcbddis et 
al.1995), 

Anodier major cost of insulin pxoductioii is purification; <dbro m atography accounts for 
30% of opttcating expenses and 70% of equipmoit in production of insulin (Petridis et al. 1995). 

20 Therefore, new ^proaches are necessary to minimize or eliminate chromatogFaphy m insuHn 
production. One such approach is the use of GVGVP as a fusion protein to facilitate single step 
purification without tiie use of chromatography. GVGVP is a Protein Based Polymer (PBP) 
made from synthetic g^es; at lower temperatures this polymer exists as more extended 
molecules. I^ixm raising the temperature above the transition range, polymer hydrpphobically 

25 folds into dynamic stroctures called ^-fip]rals that further aggregate by hydrophobic association 
to form twisted filameffits (Uny, 1991; Uny et aL» 1994). Inverse tBn^>eiatuie transition offers 
several advantages. It &cilrtates scale vp of pmjfication fixmi grams to kilograms. Milder 
purification condMon requires only a modest change in tempeiatuie and iordc stxengOL This 
dioald also &cilitate faig|»r recoveiy, fester purification and id^i volume prooesam^ protein 

30 pmification is g^siaally the slow step (botfleneck) m phaxmaoeutical {voduct development 
Ihroug^ e3q>loitati0n of this reversible inverse tempemture transition property, simple and 
mexpeusivB extraction and purification may be performed. The temperature at which the 
aggregation takes place can be manipuhted by engineering biopolymers containing varying 
numbers of repeats and changing sah concentration in solution (McPherson et al., 1996). 

35 Chloroplast mediated CTcpression of insulin-polymer fusion protein should eliminate the need for 
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feimentatiQii process as well as reagents needed for recombinant protein 
purification and downstream processing 

Oral ddiveiy of insulin is yet another powerful q>proach that would eliminate 97% of the 
5 pn^ductiQncostofinsDlin^etridiselaL 1995). For exaniple» Son et al (1994) have shown that 
feeding a anall dose of antigens eoigugBlBd to die receptor binding ncm-to3dc B subunit moiety 
of the cholera toxin (CTB) si^pressed systemic T cell-mediated inflammatory reactions in 
animals. Oral administration of a myelin antigen conjngated to CTB has been ^wn to protect 
animals against enccfilialan^litis* even wben given aBcr disease tndoction (Sun et aL 1996). 

10 BetgBTDt et al. (1997) r^rted tiiat feeding small amounts of homan insulin conjugated to CTB 
siq^>cessed beta cell destruction and cHnical diabetes in adult non-obese diabetic (NOD) mice. 
The protective effect could be trans&ired by T cells from CTB-insulin treated animals and was 
associated witii reduced insulitis. Ibese results demonstrate that protection against autoimmune 
diabetes can indeed be achieved by feeding small amounts of a pancreas islet cell auto antigen 

15 lidffid to CTB (Bergerot et al. 1997). Conjugation witih CTB facilitates antigen detivery and 
presentation to Use Out Associated Lymphoid Tissues (GALT) doe to its affinity fat the cell 
sur&ce receptor (iM i-gangjioside located on GALT ceDs, for increased uptake and tnmiunologic 
recogDiti(»i (Arakawa et aL 1998). Transgenic potato tnbeis esqpressedup to 0.1% CTB-insuMn 
fosion protem of total sohible protem, which retained GMrganglioside bmding afGnity and 

20 native antogenicily for both CTB and insulin. NOD mice fed with transgenic potato tubm 
containing microgram quantities of CTB-insulin &sion protein showed a substantial redaction in 
insulitis and a delay in the progression of diabetes (Aikawa et al. 1998). However, for 
commmdal e^loitation» the levels of expression should be increased in transgenic plants. 
ThQ:c£are» we propose here expression of CTB-insulin fusion in transg^c chloroplasts of 

25 nicotine fitc edible tobacco to increase levels of expre ssi on ad cqa ate fiar animal testing. 

Taken tog^or, k>w levels of expression of hmnan proteins in nuclear transgenic plants, 
and difikuhy hi folding assembly/^irocesshig of human proteins in E,coH shonki make 
chloroplasts an allentate compartment fiir eaqnession of these protdns; production of buman 

30 proteins in transgenic chloroplasts should also dramatically lower die production cost Lar^ 
scale production of insulin in tobacco in conjunction with an oral delivery system should be a 
powerful appioBch to provide treatment to diabetes patients at an affordable cost and provide , 
tobacco ^axists alternate uses for this hazardous c^op. Therefore, the first objective of this 
project is to use poly(GyGVP) as a fusion protein to enable hyper-expression of insulin and 

35 accQnq)lii^ rapid (me step purification of ttie fiision peptide utilizing the inverse tranperahire 
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^rties of this potymer. The second objective is to develop iiisnlin-CTB fiision 
jiiotein for oi»l delhreiy in nicotiiie fiee edible tob^ 

Both objectives will be accos^>]ishfid as follows: 
5 a) Develop lecombinmit DNA vectors for enhanced eiqgessian of Proinsalm as fpsioii pHrteins 
with GVGVP or CIB via chteroplast genomes of tobacco 

b) Obtain trassgedc tobacco (Petit Havana &LAMD 605) p]^ 

c) Characterize transgenic expxesston of prdnsolin poljmiier or CIB fiision proteins using 
molecular and biochemical methods in chloropIastB 

10 d) En^Ioy existing or modified methods of po^rmerponficati^ 

e) Analyze Mendelian or maieiiial inheritance of traasgeoio plants 

f) Large scale porification of insulin and comparison of cunrent insulin purification me&ods 
with polymer-based purification method in KcoH and tdTacco 

g) Conqpare natural refolding in chlonyplasts with firivt^ 

15 h) Characterizalioa^eM and purity) of pioinsulin produced in j&co/i and tr 

i) Assessment of diabetic symptoms in mice fed with edible tobacco e^qnessing CTB-insulin 
fusion protem. 

b. BACKGROUND AND SIGNIFICANCE 

20 Diabetes and Insulin: llieniost obvious action of insulin is to lower b^ 

1973)l This is a result of its immediate efiect in increasing glucose i^)take in tissues. In muscle, 
under the action of insulin, ghicose is more leadily taken up and eith^ converted to glycogen 
and lactic add or OTodized to carbon dioTode. Lisulin also a&cts a nundw 
concerned with cdhdar metaboHsoL It increases flie activity of gihicokinas6» which 

25 pho^dioxylatcs iJucose thmby inamsing the late of glucose metabolism in die livec Insulin 
also siqiipresses ghjconeogeoesis by depressmgtiie function of Hver enzymes, vriuch operate the 
levearse pathway fiom proteins to glucose. Lack of insulin can restrict the tiansport of ghicose 
into muscle and adipose tissue. This results in mcreases in blood glucose levels Qiyp&c^ycesma). 
M addition, the breakdown of natural &t to fiee &lty adds and g^yce^ 

3D a rise in the &tty acid content in the blood. Ihc^reased catabolism of fetty adds by die liver results 
in greater production of ketone bodies. Hi^ diSuse £rom the Ever and pass to the muscles for 
further oxidation. Soon, ketone body production rate exceeds oxidation rate and ketosis results. 
Less amino adds are takssn up by the tissues and protein degradation results. At the same time 
gtuconeogenesis is stimulated and protein is used to produce glucose. Obviously, lack of insulin 

35 has serious ccmseqnences. 
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Diabetes is classified into types I and IL Type I is also known as insulin dependent 
diabetes mellitos (IDDM). Usually this is caused by a cell-mediated oatoimmune destruction of 
ttie pancreatic p~cells (Davidson, 1998). Those soffenng &am this type aie dependent on 
5 external sources of insulin. Type n is known as noninsulin-dependent diabetes xnellitus 
(NIDDM). This usually involves resistance to insulin in ccwnbination with its undeiproduction. 
These pzomineat diseases have led to ext^ve research into microbial production of 
recombinant human insulin (ifll). 

10 Eaqprcssion of Recombinant Hmnan bsidiB in Kcofb In 1978» two ftousand kilograms of 
insulin were used in the world each year, half of diis was used in the United States (Steiner et aL» 
197S). At tiiat tune, the numb^ of diabetics in the US were increasing 6% eveiy year (Gunby, 
1978). hi 1997-98> 10% increase in sales of diabetes care products and 19% increase in insulin 
products have been r^xnted by Novo Nordisk (world's leading supplier of insulin), making it a 

15 7,8 billion dollar industry. Annually, 160,000 Americans axe killed by diabetes^ making it die 
fourth leading cause of death. Maiqrinetfaods of producticm of lifl have been developed. Lisulin 
genes were first ch emicall y synftesized for e^qnession in Esherichia coU (Qcea et aL, 1978). 
Hiese genes ei^oded sq>aiate insulin A and B chairs The genes were each e:q»ressed in R coH 
as fiision proteins widi the ^-galactoddase (Croeddel et aL, 1979). The first documented 

20 production of ifll using ttns system was reported 1^ David Gocddel fixxm (Scnentech (Hall, 
1988). The genes were fiised to the Tip synthase gene, which resulted is increased insulin yield, 
due to the smaller fusion peptide. This fusion protein was approved for commercial production 
by Eli miy in 1982 (Chance and Frank, 1993) wi& a product name of Humuhn. As of 1986, 
HumuHn was produced fiom proinsuHn genes. Proinsulin contains both insulin chains and the C- 

25 pq»tide diat connects them. Normal in vitro post-translational processing of proinsulin mctudes 
use of trypsin and caiboxypqptidase B for maturation to insuMn. Odier data concemmg 
commadal j^xKfaiction of Humulin and other insnhn products is now considered proprietary 
inforrnation and is not available to die pubUc. 

30 Protdm Based Pdlymers (PBF): The synttietic gmie that codes fx a bioelastic FBP was 
designed after repeated ammo acid seqi^nces GVGVP, observed in all sequenced mammalian 
elastin proteins (Yeh et al. 1987). J^astin is one of die strongest known natural fibers and is 
present in sldn, ligaments, and artenal walls. Bioelastic PBPs containing multiple repeats of this 
pentamer have remarkable elastic p roperties, Miflh ltng several medical • and non-medical 

35 api^calions CLIny et al. 1993, Ucry 1995, Daniell 1995). GVGVP polymers prevent adhesions 
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;ery» aid in leconstructing tissues and delivering diugs to the body over an 
extended period of time. NorA American Science Associates, Inc. reported &at GVGVP 
polymer is non-toxic in mice, non^iisitizing and non-antigraiic in guinea pig^ and nan- 
pyrog^c in rabbits (Uny et aL 1993). Researchers have also observed that inserting sheets of 
5 GVOVP at die sites of contaminaled wounds in xats reduces the number of adhesions that form 
as the woimds heal (Uiry et aL 1 993). In a similar imumer, using the OVGVP to encase muscles 
that are cot during eye surgery in rabbits prevents scarring following &e operadon (CJxxy et aL 
1993, Uny 199^. Othor medical applications of bioelastic PBPs inchide tissue reconstiaction 
(synth^c ligaments and arterieSi bonesX wound oaveriiig3» artificial pedcaidia, catfaetera and 
10 programmed drogdelivecyCUny, 1995;Uny etaL» 1993» 199$). 

We have eoqxressed die elastic PBP (GVGVP)i2i in K coli (Guda et al. aLl Brixey ^ 
aL 1997)> in the fungus AspergiUus nidukms (Eleizog et aL 1997), in cultured tobacco cells 
(2iang et aL 1^^, and in transgenic tobacco plants (Zhang et aL 1996). In parttcular, 
15 (OV<jVP)i2) has been expre sse d to such high levels in K coU that polymer imdusion bodies 
occupied iqp to about 90% of the cell volume; also^ indnsion bodies have been observed in 
chlomplasts of transgenic tobacco plants (see attached article, Daniell and Guda, 1997). 
Recently, we reported stable transformation of die tobacco chioro|dasts by integraticn and 
esqptession titie biopoj^er grade (EG121X into the Large Single Copy region (5,000 copies per 
cell) or the Inverted Rqieat regkm (10, 000 c<^ies per cell) of tiie diloroplast genmne (Guda et 
aL,2000). 

PBP as Fusion Protdm: Several systems are now available to simplify piotdn putiScation 
including tiie maltose binding protein (Marina et al. 1988)» ghxtotfaione S^ransferase (Smith and 
Johnson, 1988), biotinylated (Tsao et aL 1996), tfaiozedoxm (Smith et aL 1998) and celhilosc 
binding (Qng et aL 1989} proteins, in order to e&ctivefy utilize afbreokentioiied fusion protmns 
in the puiification process, recombinant DNA vectors fcHr fusion widi short pqrtides are now 
available (Smidi et al. 1988; Kim and Rames, 1993; Sn ^ aL 1992). Recombmant proteins are 
gBoerally purified by afiMty (^xomatogiqpby^ 
30 et aL 1997). While these are useful tedmiques for laboratory scale purification, affinity 
chromatogr^hy for large^^cale purification is time consuming and cost prohibitive. Thmfore, 
economical and non-chromatographic techniques are hig}ily desirable. In addition, a common 
solution to N-tenninal degradation of small peptides is to fuse foreign pq[>tides to endogenous E, 
coU proteins. Early in the develqpmmt of this techni^^e, P-galactosidasc 0-gaO was used as a 
35 fiision protein (Goldberg and Gof^ 1986). A drawback of this method was that die P-gal protein 
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high moleonlar weight (MW 100,000). Therefore, the propoition of the peptide 
product in the total protein is low. Another problem associated with the large P-gal fusion is 
early termination of translation (Bumette, 1983; Hall, 1988). This occurred when p-gal was i^ed 
to produce human insulin peptides because the Aisicm was detached firom the ribosomc during 
5 translation thus yielding ioconQilete peptides. Jn aider to increase the peptid& production, other 
proteins of lower molecular wei^ proteins have beoi used as fiisicm proteins. For example, 
bett^ yields were obtained with the tryptophan synthase (190aa) fusion proteins (Qall, 1988» 
Bnmetf^ 1983). 

10 One of the primary gpals of tins study is to use poIy(GVGVP) as a fbsiim protetn to 

enable hyper-eoqxiessian of insulin and accoinplish r^d ono step purification of the fusion 
pqptide. At lower temp^ntures the polymers exist as more extended iDolecttles wM 
&e tempmture above &e transition range hydrc^obically fold into dynamic structures called 
spirals that furtiber aggregate by hydrophobic association to form twisted filaments (Uiry, 1991). 

15 Ihxough e7q>loitation of this reversible property, single and inc3q)a]sive extraction and 
purification is perfonned. The teanperature at which aggregation takes place (TO can be 
manipulated by engineeriiig biopolymers containing varying numbera of rqpeats or changing sah 
ccmceotraticMi (McPherson et aL, 1996). Anottier group has recently dcnumstiated purification of 
recofmbmant protefais by fiisioai with thermally responsive polypeptides (Meyer and CMkoti, 

20 1999). Polymers of dLffercnt sizes have been synthesized and e^qnressed in Kcoli in Ae PFs 
laboratory. This aj^roach would also eliminate the need for expensive reagents, equipment and 
time required for purificatiott. 

Cholera Toxin ^ subnnit as a fiisifHi protein: Vibrio ckolerae causes diarrhea by cc^onizingtiic 
25 small intestiiie and producing enterotoxins, of which the cholora toxin (CT) is considered the 
main cause of toxicity. CT is a hexamedc AB5 protein having one 27KDa A subunit wMdi has 
toxic ADP-ribosyl transferase activity and a non-toxic peiitamer of 11.6 kDaB sobonits that are 
noD-covakntly finked intD a very staUe dong^imit Hke sirueuire into winch the toxic active (A) 
subunit is mserted The A subunit of CT consists of two fi»^^ and Alwhidi are linked 
30 by a disulfide bond. The enzymatic ;ictivity of CT is located solely on tiie At fragment (Gill, 
1976)l The A2 fragm^ of fte A subunit links the Al fragment and the B pCTtamer; CT binds 
via specific interactions of tiie B subunit pentamer with OMl ganghoside, the memfarane 
receptor, psesesai on tiie intestinal q[>i1helial cell sur&ce of the host The A saburdt is thra 
translocated into the cell where it ADP^ibosylates tlie Gs subunit of adenylate cyclase bringing 
35 about tiie increased levds of cyclic AMP in affected cells tiiat is associated witii the electro^ 
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of clinical cholera (Lebens et aL 1994). For optima] enzymatic activity, the Al 
firagmeiit needs to be sq>arated fiom the A2 fogment by pxt^ cleavage of the main chain 
and by leduction of die disulfide bond linking them (Mekalanos et al 1979). 

5 Hie E}qxression and assembly of CTB in transgenic potato tubers has been r^rted 

(Arakawa et aL1997). The CTB gene inchuting the leader peptkie was iused to an endoplasmic 
r^cufaim letentton signal (SEKDEL) at tte 3' end to sequester the CTB {Hroteus within the 
hunen of the BEL The DNA firagment encoding tiie 21-amkio add leader peptide of the CTB 
protein was retained in order to direct the newly syn&esized CTB protein into the Inmen of the 
10 ER. homEKunoblot anatysis indicated that the plant doived CTB piotein was aotigenically 
t indistinguishable fitom tiie bacterial CTB protein and fbst ol^omeric CTB molecules (Wb SO 
kDa) were the dominant molecular species isolated fiom transgenic potato leaf and tuber tissues. 
Similar to bacterial CTB, plant derived CTB dissociated into mcMKuners Qhir-lS kDa) during 
heat/add trBatment 

15 

Bnzyme linked hmnunosoibait assay methods indicated that plant synthesiygd CTB 
protein bound specifically to GMl gai^osides^ flte natural membrane recqtors of Chotea 
Toxin. The maximimi amount of CIB piotein detected in auxin induced transgenic potato leaf 
and tuber tissues was s^^noxnnately 03% of ^ total soluble protein. The oral nnmunization of 

20 CD-I mice with transgenic potato tissues transformed with die CTB gene (administeied at 
weekly intervals for amondi with a final booster feeding on day 65) has also been reported. The 
levels of serum and muco^ anti-chol^ toxin antibodies in mice w^ found to generate 
protective imnmnity against the cytopathic effects of CT holotoxiiL FoUowing intraileal injection 
witii CT, tiie plant immunized mice showed iq» to a 60% reduction in diarrheal fluid 

25 accunmlation in the smaU intestine. Systemic and mucosal CTB- specific antibody titers were 
determined in bodi senna and feces collected fixmi immunized mice by tiie class-specific 
chemihnninescent ELESA method and the en^int titers for die dnee antibodly isotypes ( 
IgMyigQ and IgA) were deteonined. The extent of CT neutralization in bodi Vero cdl and ileal 
loop experiments suggested diat antt^TIB antibodies pKfvcot CT binding to cellular OMI- 

30 garigliosides. Also, mice &d with 3 g of transgmc potato exhibited similar intestinal protection 
as mice gavaged with 30Qg of bacterial CTB. Recombmant LTB [iLTB] (the heat labile 
enterotoxin produced by EnteiotoxigiEnic Rcoli) which is structurally, fonctionally and 
immunologically similai to CTB was expressed in transgenic tobacco (Amtzen et aL 1998; Haq 
et aL 1995). They Imve reported that» the riLTB retained its antigenid^ as shown by 

35 immunoprec^itation of rLTB with antibodies raised to rLTB fi^om E. coti. The iLTB prot^ was 
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lolecular weig^ and aggregated to fonn Uie pentamer as confiimed by gal 
permeation duomatograi^. 

Belivay of Hnnuin Insiiliii: Lisulin bas been ddivered inlraveiKnisfy in tiie past several years. 
5 However^ more recently, alternate methods sncili as nasal spray, axe also available. Oral delivery 
of insulin is yet anotiier new approadi (Malbiowilz et aL, 1997). Bngpieered polymer 
micaroqiheaKs made of biok^caUy erodable polymers* which display strong interactions with 
gastrointestiiial nmcns and cellular linings, can traverse berth mucosal abscnptive qntheltiim and 
the foniclo-assodated epi&eHum, covering the lymphoid tissue of Peyi^s patches. Polymers 
10 maintain contact widi inte^inal epitfaelhnn for extended periods of time and actually penetrate 
through and between cells. Animals fed with die poly(FA: PLGA)-encapsu]ated insulin 
preparation were able to r^;ulate the glucose load better than controls, ccmfiiming that insulin 
crossed the intestinal barrier and was released from the micro^heres in a biological^ active 
fonn C^fothiowitz et aL» 1997). 

15 

BestdeSy CTB has also been demonstiated to be an effective carrier molecule for the 
indoctkan of mucosal irrmmnity to polypeptides to jfMch it is chemically or gqieticaOy 
conjugated QAcKboob^ et aL 19S4; Dertzbaugh et al. 1993) The prodoction of 

umnunomodolatory transrrmcosal carrier molecules, such as CTB, in plants may greatly improve 
20 the efficacy of edible plant vaccines (Haq et ai. 1995; Thanavala et al. 1995; Mason et aL 1996) 
and may also provide novel oral tolerance agents for prevention of such autoimmune diseases as 
Type I diabetes {Zhang et al. 1^1), Rheumatoid arthritis (Treaitham et aL 1993)4nultq)le 
sclea:osis( Kk>uiy ^ aL 1990; MiUer et aL 199% Werner et aL 1993) as well as theprevraition of 
allergic and allograft rejection reactions (Sayc^ et aL 1992; Hancock et aL 1993). Therofine; 
25 egqxessiogaCTB-inoinsiilinfi^ 

Chloroplast G«ietic Engineering: When we developed the concept of diloroplast genetic 
ai^needng (Daniell and McFadden» 1988 U.S. Patents; Damdl, World Patent, 1999^ it was 
possible to introduce isolated intact cfaloroplasts into protoplasts and r^enerate transgenic plants 

30 (Carlson, 1973). Therefcxre, eariy invesdgadons on chloroplast transformation focused on the 
devdopment of m organeUo systems using intact diloroplasts enable of efficient and prolonged 
transcription and transMon (Daniell and Ri^iz, 1982; Daniell et al., 1983, 1986) and 
e^qaression of foreign g^nes in isolated ddoioplasts (Daniell and McFadden, 1987). However, 
after die discovery of fte gene gun as a transfomation device (Daniell, 1 993), it was possible to 

35 transform plant diloroplasts without die nse of isolated plastids and protoplasts. Qiloroplast 
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was accomplished in seveial phases. Transi^t expression of fisreign genes 
in plastids of dicots (I>aniell et aL, 1990; Ye et aL, 1990) was followed by such studies in 
monocots (Daniell et aL» 1991). Unique to die chloroplast genetic engineexing is die development 
of a foreign gene expression system using autonomously replicating chloroplast expression 
5 vectors (Daniell et al., 1990). Stable integration of a selectable marker g^e into the tobacco 
chloroplast genonze (Svab and Maliga, 1993) was also accomplished using the gene gort 
However, nsefol genes confening valuable traits via chloroplast genetic engineering have been 
demonstrated only recently. For exanqpte^ plants resistant to B X sensitive insects were obtained 
byirit^iratingtheco^Z^^gem into the tobacco cUoropl^ 199S)i Plants 

10 resistant to B.t resistant insects (isp to 40,000 fold) vrm obtained by hyper-eoqiressiQn of the 
ayUA gene widiin the tobacco chloroplast genome (Kota et aL, 1999). Plants have also been 
gen^ically engmeered via the chloroplast genome to confer herbicide resistance and the 
introduced foreign genes were mntrmnlly inhciited» overcoming the problCTS of ont-cross widt 
weeds (DanieU et al, 1998). Qilorc^kst genetic engineering has also been used to produce 

15 pharmaceutical products diat are not used fay plants (Staub ct aL 2000, Ouda et aL 2000). 
Chloroplast graietic engineering tedmology is currently being ^plied to other useful crops 
(Sidmov et aL 1999; Daniell, 1999). 

e. PKEUMINARY STUDIES 

20 A remark^le feature of chlciqplast genetic engineering is the observation of 

exceptionally large accumulation of foreign proteins in transgenic plants, as much as 46% of 
CRY protein in total sohible protem, even in bleadied old leaves (DeCosa et aL 2001). Stable 
expressioin of a p^haimaceutical protein in chloroplasts was first reported for GVGVP» a protein 
based polymer widi varied medical plications (such as the pr^fV&aAon of poi^-surgical 

25 adhesions and scars, wound covedngg^ artificial pericardia, tissue reoonstroction and' 
progtammed drug delivery (Gioda et aL 2000). Subsequoitly, e^qxressian of Hxs human 
somatotrqpm via die tobacco dilcxroplast genome (Stanb et aL 2000) to hi£^ levels (7% of total 
soluble protein) was observed. The fi>llowing mvestrgadons that axe in progress in die Daniell 
lab iDustiatc Hxc powor of dus tochnok)gy to express small peptides, entire operons, vacdnes that 

30 require oligomcric protons with stabb disulfide bridges and monodonals that require assonbly 
of heavy/H^ chains via chaperonins. In order for edible iosulin approach to be successful, it is 
essential to develop a selection system fiee of antibiotic resistant genes. One such marker free 
chloroplast transformation system has been accomplished in this laboratory (Daniell et al. 2000). 
Experiments are in progress to develc^ chloroplast transformation of edible leaves (al&l& and 

35 lettuce) for die practical applications of this appioadL 
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Engiiieermg novel pathways via the chloroplast genome: In plant and animal cells, nuclear 
mRNAs are translated monocistronicdlly. This poses a serious piobl^ when engineering 
multiple graies in plants (Bogjor^d, 2000). Therefore, in order to e^qxress the polyhydroxybutyrate 
5 polymer or Guy's 13 antibody, smgle genes were first introduced into individual transgenic 
plants, dien these plants were back-«iossed to reconstitute the entire pathway or the coii^>lete 
protein ^avrath et aL 1994; Ma et al. 1995). Similady, m a sevwi year long efifort. Ye et al. 
(2000) recently introduced a of tiiree genes for a short biosynlhetic pathway that resulted in 
P-carotene eaqiression in ric& In contrast, most chkm^last genes of hig^ plants are 
10 cotianscribedCBogoi8d>2000).Baq»rcs8iQnofpdyGistromvi^ 
uiuque opportimity to eoqpiess entire pathways in a siiig^ 

used the Badlbts 0mringfemis (Bt) aylP^ apesxm as a model Systran to demonstrate operon 
e^qjression and crystal formation via the chloroplast genome (De Cosa et aL 2001). 03^2Aa2 is 
the distal gene of a three-gene operoiL The orf immediately iqpstceam of cryQAal codes £3r a 
15 putative ch^>eionin diat ^cilitates the folding of aylAal (and other proteins) to form 
proteolytically stable cuboidal crystals (Ge et al 1998)» 

Thexefc^e, the cr^lAa2 bactenal operon was expressed in tobacco chloroplasts to test the 
resuhant transgenic plants for increased eacpxesskm and improved persistence of the accumulated 

20 msecticidalprotein(s). Stable foreign gene integration was confirxned by PGR and Southern blot 
analysis in Tq and Ti transgrauc plants. Cry2Aa2 operon derived protein accmnulated at 453% of 
the total soluble protein in mature leaves and remained stable even in old bleached leaves 
(46.1%)0Pigure 1). This is the Mgjiest level of foreign gene e^^ression ever reported in 
transgenic plants. Exccedm^ uncontrollable insects (10-day old cotton boDwarm, beetarmy 

25 worm) were killed 100% afier consuming tran^enic leaves. Electron microgc^hs showed the 
IHresence of the insectiddal protean folded into cuboidal crystals similar in shape to Cry2Aa2 
crystals observed in BaeSbis thwingfiensis (Rgnre 2). In contrast to currently marlreted 
transgenic plants with soluble CRY proteins^ folded protoxin oystals will be processed only by 
tar^t insects that have alkaline gpt p£^ tins sqpproach should hrqaove safety of Bt trax^genic 

30 plants. Absence of insectieidal protenis in transgenic pollen AHmi'Tittt*»>Q toxicity to non-taiget 
insects via pollen. In addition to diese enviionmentally fiiendly ^yproaches, this obsovation 
should serve as a model system for large-scale production of foreign proteins within chloroplasts 
in a folded configuration enhancing their stability and iacilitating single step purification. This is 
the first demonstration of e7q>reasiQn of a bacterial operon in transgpnic plants and opens the 

35 door to engineer novel pathways in plants in a single transformation event 
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Expressing small pqptides via the chloroplast genome: It is common knowledge that tiie 
medical commimity has been fighting a vigorous battle against drug resistant pathogenic bacteria 
for years. Cationic antibactexial peptides fiom mammals, amphibians and insects have gained 
5 more attention over the last decade (Hancock and Lehiea-^ 1998). Key features of these cationic 
pe{^des are a net positive chaige^ an afBnity for negatively-charged prokaryotic memtame 
phofigdiolqdds over neutral-charged eukaryotic memlnranes and the ability to form aggregates that 
disnq^ the bacterial memfaiane (Biggin and SansQQi, 1999). 

10 There are toee major pqjtides with a-helical structures, cecrc^nn firmn Ifyalopkora 

cecropia ^tant silk moth), magtmins ftam Xenopus laevis (Afidcan frog) and defensins fiom 
mammalian neutrophils. Magainin and its analogues have been studied as a broad-spectrum 
topical agent, a systemic antibiotic^ a wonnd-healing stimulant; and m anticancer agent (Jacob 
and Zaslof^ 1994). We have recently observed that a synthetic lytic peptide (MSI-99, 22 amino 

15 adds) can be successfully es^ressed in tobacco chloroplast (DeGray et aL 2000). The p^de 
retained its lytic activity against the phytopathogpiic bacteria Pseudomonas syringpe and 
nmltidmg zeststaot human palhog^ Pseudomonas aerug&iosa. The and-microfaial pqitide 
(AMP) used in this study was an anq^padxic alpha-helix molecule that has an affinity for 
n^atively charged phoqiholipids commonly found in the outer-membrane of bacterra. l^n 

20 contact widi these membranes, individual peptides aggn^ate to £mn pores In the membrane^ 
lesolting in bacterial lysis. Because of the concentration d^end^t action of the AMP, it was 
expressed via chl oroplast g^ome to accon^lish high dose delivery at the porot of infectkm. 
PGR products and Southern blots confirmed chloroplast int^ration of the foreign genes and 
homoplasmy. Giowdi and dcvclopinent of the transg^c plants was unaffected by hyper* 

25 e^qxression of the AMP witiiin cfalorqplasts. Jh vitro assays with To and Ti plants oonjBnned that 
^ the AMP was eoqiressed at high levels (21^ to 43% of the tcytal soluble protdn) and retained 
biological activity against FseudkMMOBovjr^^ a m^or plant pathogen, ih site assays resulted 
in mtrase areas of necrosis around the point of infecticm hi control leaves, while transformed 
leaves showed no signs (tf necrosis (200-800 pg of AMP at the site of iiifecticHi)(Figare 3> T| in 

30 vitro assays against Pjatufoymifosr aerugawaa (armilti-dnig resistant human pathogen) di^layed 
a 5^6% inhibttio^t of growth (Figure 4). These results give a new option in the battle against 
pbytopatho^oiic and drug-resistant human pathogenic bacteria. Small peptides (^kxi insulin) are 
degraded in most organisms. However, stability of this AMP in chloroplasts opens up tins 
conqiartment for esqpression of hotmones and other small jwptides. 

35 
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d assembly of monodonak in transgenic chloroplasfs: Dental caries (cavities) 
is probably the most prevalent disease of hnmankind Colonization of teeth by & mutans is the 
single most important nsk factor in the development of dental caries. S, mutans is a non-motile, 
gram positive coccus. It colanizes tooth sur&ces and synthesizes gtocans (insoluble 
5 polysaccharide) and fructans from sucrose using die enzymes gitucosyltransferase and 
fruclDS^tiansferase respectively (Hotz et al. 1972). The ghicans play an inqmrtant role by 
allowing tiie bacterium to adhere to the smooth tooth sor&ces. Ailer its adherence, the bacterium 
fennents sucrose and produces lactic add Lactic add dissolves the minerals of the tootfa» 
IMmhidng a cavity . 

10 

A topical monoclonal antibody ther^y to prevmit adherence of jSI mutans to teeth has 
recently been developed. The incidence of cariogenic bacteria ^ humans and animals) and 
dental caries (in animals) was dramatically reduced for periods of i^ to two years after die 
cessation of the antiboc^ therapy. No advose events were detected dther in the exposed animals 

15 or in human vohmteers (hda et aL 1998). The annual requirement for this antibody in the US 
alone may eventually exceed 1 metric ton. Therefore, this andbody was eaqzressed via the 
chloroplast genome to achieve higiher levels of 63qpi:essian and proper folding (Pandml ^ aL 
2000X integration of antibody gjenes into the chloocoplast genome was confirmed by PGR and 
Souttiem blot anafysls. The eapression of both heavy and light chains was confirmed by western 

20 blot analysis under reducmg conditions (Hgnre lAfi). The expression of folly assenibled 
antibody was confirmed by westran blot analysis under non-redudng conditions (Figure 7C). 
This is the first rqxnt of successfol assembly of a multi-subunit human protein in transgenic 
chloroplasts. Ptoduction of monoclonal antibodks at agricuhuial level should reduce thdr cost 
and mate new applications of monoclonal antibodies. 

25 

Marker free cUoroplast transgenk jdants Most transformation tar.iin^'qpfffl oo-intzoduce a gene 
that confess antibiotic resistance, along widi die gene of hiterest to inq>art a desned trait 
Regraierating transformed cells in antibiotic containing growth media permits selecdon of only 
diosc cells that have incorporated foe foreign genes. Once transgenic plants are regenerated, 

30 antibiotic resistance genes serve no nsefol purpose but they contrnoe to .produce thdr gjene 
products. One ^mnng die primary concems of genetically modified (ONf) crops is the presence 
of clinically impc»rtant antibiotic resistance gpne products in transgenic plants that could 
inactivate oral doses of the antibiotic (reviewed by Puchta 2000; Daniell 1S^9A). Alternatively, 
the antibiotic resistant g^ies ccndd be transferred to pathogenic microbes in die gashointestii^ 

35 tract or soil rendering them resistant to treatment with such antibiotics. Antibiotic resistant 
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e of the major challenges of modem medicine. In Gemmny, CSA crops oontaiiiing 
antibiotic icsistant genes have been banned fiom release (Peexenboom 2000) . 

QdcHfopIast genetic engineeriDg offers several advantages over nuclear transformation 
5 . including high levels of gene expression and gw containment but utilizes thousands of copies 
of tiie most comoEKmly used antibiotic zeststance genes. Engineering genetically modified (GM) 
crops without the use of antibiotic resistance genes shouM eUminate potential lisk of their 
transfer to ttie en vir onment or giit microbesL Therefore^ betaine alddxyde ddiydiog^nase 
(BADIQ gate fiom spmach is used in this study as a selectable maiker (Daniell et aL 2000). The 

1 0 selection process involves conveman of tajoc betaine aldel^de (BA) by the chhxraplast BADU 
enzyme to nontoxic g^dne betaine, wbich also serves as an osmqprotectant Chloroplast 
transformation efficiency was 25 fold M^ier in BA selection than spectinomycin, in addition to 
rapid regen^tion (Table 1). Transgenic shoots qypeared within 12 days in 80% of leaf discs 
U> 23 shoots per disc) in BA selection compared to 45 days in 15% of discs (1 or 2 shoots pec 

15 disc) on ^>6ctinomycin selection (Figure 8). Southexn blots confirm stable integration of foreign 
g»ies into all of tiie chloroplast genomes (-10^000 copies per cell) resulting in homoplasmy. 
Tian^emc tobacco plants showed 1527-1816% higher BADH activiiy at different 
developmental stages than untransfonned controls. Transgenic plants were morphologically 
Tn^igMp fl™^**^^ *^ fiom ii n irangfiirmftrt plants and the introduced trait was stably inherited in the 

20 subsequent generaticHL This is die first report of genetic engineering of tiie chlonq^last genome 
without the use of antibiotic selection. Use of genes that axe naturally present in qnnach fcnr 
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idition to gene containment, should ease public concenis or perception of GM. 
crops. Also, Ibis should be veiy helpful in the development of edihie insulin. 

Expression of cholera toxin p subunlt oligomers as a vaccine in chloroplasts: CTB wh^ 
5 administered orally (Lebens and Holmgren, 1994) is a potent mucosal imnninogen, which can 
nojtralize die toxidly of the CT holotoxin by preventing it Scam binding to the intestinal cells 
(Mor et aL 1998). This is believed to be a resolt of it binding to eidcaiyotic cell surfeces via the 
G|^l g9n£^osides>iecq[>tQis present on the intestinaiqnfiielMi^^ 

inrnnme response to pattogens (Ltpscombe et aL 1991) and enhancing ttie tmmime reqxMise 
10 ixdienGliemica]tycoiq)led to odier antigens (Dertzbang^ 1993; Hohngien et aL 1993; 

NasharetaL 1993; Sun etaL 1994). 

Cholem toxin (CTB) has previously been esqnessed in nudear transgenic plants at levels 
of 0.01 (leaves) to 03% (tubers) of the total soluble protein. To increase expression levels, we 

1 5 engineered &e chloroplast genome to e3q)iess the unmodified CTB gene (Henriques and Daniell, 
2(M)0). We observed expression of oligomsric CTB at levels of 4-5% of total sohible plant 
protein (Figure 5A). PGR and Southem Blot analyses confirmed stable integration of the CTB 
gene into the chloipplast genome. Western blot analysis showed that transgemc chloroplast 
expressed CTB was antigenicatty identical to commercially available purified CTB antigim 

20 (Figure 6). Also, (J^i-gangjioside binding assays confirm that chloroplast synthesized CTB 
binds to Ihe intestinal memibcane receptor of cholera toxin (Figure SB). Transgenic tobacco 
idants were moiphologicalty indistinguidiable 6om untiansfonned plants and die introduced 
gene was fbond to be stably inherited m die subsequent geneialian as confinned by Pdt and 
Soutiiem Blot analyses. The increased prodiK:tion of an efCdent tiansnmcosal carrier molecule 

25 and delivoy system^ Uke CTB, in chlorc^lasts of plants makes plant based oral vacdnes and 
fusion proteins widi CTB needing oral administraticHi, a nmch. more feasible approach. Hiese 
observations establish unequivocally that chloroplasts are capable of forming disulfide bridges to 
assemble foreign proteins^ and ideal for expression of CTB fusion proteins. 

30 Polymer-prolnsDlln ReeomUnanf DNA Vectors: One possible insulin expression system 
involves independent expression of insuhn diains A and B, as it has been produced in E. coU for 
commercial purposes in the past The disadvantage of diis method is that KcoH does not form 
disulfide bridges in die cell unless the protein is targeted to the periplasm. Expensive in vitro 
assembly after purification b iiecessary for dds iqiproeck Therc^ 

35 to esqxess the human proinsolin as a polvmM- fiision protein. This method is ideal because 
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e equable of fimnmg disaHSde faiidges. Using a sm^e gene, as opposed to the 
indmdoal cbains, would eliininate necesaty of condactmg two parallel vector constroctioa 
processes, as is required for the individual chains. In addition, the need for individual 
fermentations and purification procedures is eliminated by the single gene method. In addition, 
5 proinsulin requires less processing following extraction. 

Recentiy, the human pre-proinsuhn g^e was obtained from Genentedi, Inc. First the pre- 
proinsnlhi was sub-cloned mto pUC19 to fiicOitate fiutiier mampulations. The next stq> was to 
desigii primcys to make chloioplast expression vectors. Since we are interasted in proinsofin 

10 oqaession, the 5' primer was designed to land on the pioinsoliii sequence. Ihis FW pdmer 
exduded the 69 bases or 23 coded amino adds of the leader or pre^^equence of pr^soinsulin. 
Also, tiie forward pximer included die ^izymatic cleavage site for the protease &ctor Xa to avoid 
the use of cyanogen tnomide. Besides the Xa-&ctor, a Smal site was introduced to facilitate 
subsequent subcloning. The orda of the FW primer sequence is Smal • Xa-fector - Proinsulin 

1 5 gene. The reverse primer included BamHI and Xbal sites, phis a short sequence with homology 
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9 sequence following ^ prouumlin gene. The 2971^ PGR product (Xa Fns) was 
cloned inlo pCR2.1. A GVGVP 50-nter was g^erated as described pieviously (Darnell et aL 
1997) along with the KBS sequence GAAGGAG. Anoth^ Smal partial digestion was pecfonned 
to eluninate tiie stqp oodon of the biopolymer gene, decrease the 5(hner to a 40mer, and fuse die 
5 40mer to the Xa-pioiiisulin sequence. Once the correct fiagntent was obtained by the partial 
digiestioa of Smal (eliminating the stop codcMi but mctuding Ifae KBS site), it was Ugated to Ae 
Xa-piQinsulin fusion g^e resulting in die cosistmct pCR2.1-4Q-XaPris. Finally, die biopolymer 
(4Qnier) -proinfiulin fusion gene was sobdcmed into die chlorqplast vector pID-CtV or pSBL- 
CtV and the orientati(»i was checked in &e final vector using suitable restriction sites. 

10 

£a^ression and Poiification of tiie Biopotymer-proinsulin fusion protein: XL-1 Blue strain 
of R coU containing pLD-OOXaPtis and the negative controls, which included a plasmid 
containing the gi^ in die reverse ariajtation and die E, co&' strain without any plasmid were 
grown in TB brotlL Cell pellets wrae resuspmded in 500^ of autoclaved dH20 or 6M Guanidine 

15 hydrodiloride phosphate bu£^, pH 7.0 were sonicated and centrifuged at 4°C at lO^OOOg for 
IQmin. After centEifttgati<Mi, the supematants were mixed widi an equal volume of 2XINbu£fer 
(100 mM Tris-HCL pH 8, 100 mM NaCl). Tubes were warmed at 42^: fat 25mm to induce 
bi<^lymer aggiegation. Then the fusion protein was recovered by centrifuging at 2,500qnn at 
42^ for 3min. Samples were run in a 16.5% Tricine ge]» transferred to the nitrocettulose 

20 membrane, and imnnmoMotting was perfiMxaed. When the sonic extract is in 6M Guanidine 
Hydroddoride Phoqihate Buffer, pH 7.0, the molecular weight changes &om its original and 
ccnrect MW 24 kD to a higher MW of approximately 30 kDa (Figure 9A3). This is probably 
due to the conformation of the biopolymer m diis buffer. 

25 The gel was fiist stained widi 0.3M CuC32 and dien die same g^l was stained wxdi 

Commassie Rr250 Statning Sohition for an hour and dien destained for ISmin firs^ and then 
ovendg^ CUCI2 cieates a negative stain (Lee et aL 1987). Polymer proteins (without fusion) 
dupgessi as dear bands against a blue backgjxmnd in color or dark against a light semiopaque 
badEground (Figure 9A). This stain was used because odier protem stains such as Coomasste 

30 Blue R250 does not stain the polymer protein due to the lack of aromadc side chains (McPherson 
. et aL, 1992). Therefore^ die observadon of die 24 kDa protein in R250 stamed gel (Hgure 9B) is 
due to the insulin fusion with the polymer. This observation was fiirdier confirmed by probing 
these blots with the anti-^iuman i»roinsulin antibody. As anticipated, the polymer insulin fusion 
im>tein was observed in western blots (Figure lOA^). Larger proteins observed (Figure lOA-C) 

35 are tetramer and hexamer contplexes of proinsuhiL It is evid^t that the insuhn-polymer fusion 
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ble iB ExoU. Confixming this observatioii, recently snofher lab has shown that the 
PBP polymer protein conjugates (with thioiedoxin and tendamistat) undergo thermally reversible 
phase transition, retaining the transition bdmvior of the &ee polymer (M ^er and Chilkoti, 
1999). These results clearly demonstrate that insnlfai fnsion has not affected the inverse 
5 temperature transition pnqperty of tiie pirfymer. One of ^ concems is the stability of insulin 
at tmqperatmes used for fhennally leversibb purification. Tenqieratore induced production of 
hnnianmsu]inhasbee»inoomnieridaluse(Schmi 1999). Also» &e tnuperature transition 
can be lowered by incieasiiig the ionic stcei^ of the sofaitiQa during purificaticm of this PBP 
(McPherson et aL 199Q. Thus, GVGfVP-fnaion could be used to purify a nmltitude of 
1 0 ec(»iamicalfy inqxxrtant pntoins in a sirr^ie inesqiensive step. 

Biopolymer-^roinsulin fusion expression in chioroplast: As described in section d, 
chloroplast vector was bombarded into the tobacco chioroplast genome via particle 
bombardmoit (I>anicll» 1997). PGR and Southcm Blots were performed to confirm biopolymer- 
15 proinsttlin fosion gene integration into chioroplast genome. Soutiiem blots show homoplasmy in 
most To lines but a few showed some hetetoplasmy (Figure 1 1). Western blots show the 
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polymer pioinsulin fbsioii proftem in all transgenic lines (Figine IOC). 
Qnandfication using EUSA is in progress. 

Protease Xa Digestion of the Biopolymer-prolnsulin ftision protdn and Porification of 
5 Proinsiilin: The en^madc cleavage of the fiision protein to release the proinsulin protein team 
\hs (GV GVP)4o was initiated bfy adding die feckor 1 OA protease to the poiified fasion protdn at a 
ratio (wAv) of appr o xim a tely 1:500. CHeavag^ of die fusion protein was monitored by SDS- 
^ PAGE aDBiysaa. We detected cleaved proinsufin in the extracts isolated in 6M goanidtne 
hydrochlonde bufEer (Figpro IDA,?)* Conditions are now being optimized for eomphto 
10 deavage. The Xa protease has been soccessfiiUy used pievioiisly to cleave (GVGVP)2(rGST 
fusion (McPhnsonetaL 1992). 

d. RESEARCH DESIGN AND METHCM)S 
Evaluation of cfaloroplast gene expression; A systematic approach to identify and OfVcicQme 

15 potootial limitations of foreign gene expression in chloroplasts of transgenic plants is essential. 
Lnfonnation gained in this study should increase the utility of cfaloroplast transfixmadon system 
by scientists interested in eoqiressing other foreign proteins. Thmefore^ it is inqmrtant to 
s/stemadcaDy analyze transcr^tion, RNA abundance, KNA stability; rate of protnn synthesis 
and degiadadan, proper folding and biological activity. For exarapl% the rate of tianscripti<»i of 

20 the introduced insuhn graie will be CQmpared with the higldy expressing endogenous chloroplast 
gqoes (fbcL, psbA, 16S rBNAX using run on transcription assays to determine if die 16SrKNA 
promoter is operating as expected. Transgenic chlorq)last containing each of the dnee constructs 
widi different 5* r^jons will be investigated to test their transcrq>tion efGciency. Similar^, 
tiansgene RNA levels will be monitored by nordiems» dot blots and primer ^deimon relative to 

25 endograious rbcl^ 16S ifiNA» or pshA. These results along with lun on transcription ass^ 
should provide valuable infomiation of RNA stability^ processings etc. Widi our past expcneoce 
in expTessi0n of seveial foreign genes, focdga transodpts teppear to be extremely stable based 
on northern blot analyst However, a systematic stu^ would be valuable to advance utility of 
this system by other sciendsts. Most importandy, die efiBdmy of transladcm will be tested in 

30 isolated chloroplasts and compared widi die highly translated chloroplast protein (pshA). Pulse 
chase experiments would help assess if translatianal pausing, pTematme terminatian occurs. 
Evaluation of percent RNA loaded on polysomes or in constructs widi or without STJTRs would 
help determine the ef&ciency of the nbosome binding site and 5* stem-lo(^ transladonal 
enhancers. Codon optimized genes will also be casaparod with unmodified genes to investigate 

35 the rate of translation, pausing and terminaticm. In our recent e:q>erience, we observed a 200-fold 
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tccumolatioii of foieiga proteins doe to decreases in pioteolysis confeoed by a 
putative cbapejcomn (De Cosa et aL 2001). Therefore^ proteins firom constructs e^sressing or not 
ejqsressing the putative chaperonin (with or wi&out ORFl+2) should provide vahiable 
infonnation on protein stability. Thus, all of this information win be used to improve the next 
, 5 generatian of chloroplast vectors. The PI has extensive e^qperience in analysis of chloioplast gene 
expression (Relevent publications are included in resnme). 

Optimizatioii of gene expresston: We have reported that foreign genes are expressed between 
5% {erylAal) and 46% (c9y2Aa2 opeiQn) in transgenic cfaknoplasts ^ta et al. 1999; De Cosa 

10 ^ aL 2001), Several approaches will be used to enhance tcanslalinn of the recomtbinant proteins. 
InchloroplastSytrBnscrq^onaliegiilationasabotde-iie^ expression has been overcome 

by ut^liT^g tiie strong constiluitive promoter of the 16s rRNA (Ptm). One advantage of Prm is 
that it is recognized by bo& &e chloroplast encoded RNA polymerase and the imclear encoded 
chloroplast RNA. polymerase in tobacco (Allison et aL 1996). Sevoral investigators have utilized 

15 Prm in their studies to overcome the initial bunOe of gene e:q)ression, transcrqytiQn (De Cosa et 
aL 2001, Eibl et aL 1999, Staub et al. 2000)i RNA stability appears to be one emoag the least 
probleEms because of observatiDn of excessive aocmnulatian of foieigii transczqyts» at times 
16^66-foId higher than tiie faig^ expressmg iniclear transgenic plants (Lee et al. 2000). Also, 
other investigati(»is r^arding RNA stalnlity in chloroplasts suggest that efforts fbr optimizing 

20 ffne esqnessicHi need to be addressed at the post-transcriptional level (Higgs et al. 1 999» Eibl et 
al. 1999). We intend to focus our investigation to address protein expression post- 
transcr^onally. For example, 5* and 3^ UTRs are necessary fi»r optimal translation and mRNA 
stablilit/ of chlor(^last mRNAs (Zerges 2000). Optimal ribosomal binding sites (RBS's) as well 
as a stem-loop structure located 5* adjacent to RBS are required for efficient translatioa A 

25 recent study has sbxrm that replacement of tiie Stain&*Delg?mo (GGAGG) with ttie psbA S' 
UTR. downstream of fbe 16S rRNA promoter faihanrc^rt translation of a fbieign gene (GfUS) 
faundred-fiold (JBSbl el aL 1 999). Therefore, tiie 200-bp tobacco cUon^last DNA figment ( 1 680- 
1480) oontaimpg 5* psbA Unt will be used. This PCR product will be insorted downstream of 
the 16S rRNA piomotcr to enhance transladcm of the recomhiiiant pr^ 

30 

Yet another approach for enhancement of translation would be to qotimiage codon 
compositions. We hove compared A+T% content of all foreign gienes that had been expressed in 
transgenic chlorc^lasts in our laboratory witii the parcenta^ of chlorc^last e;q>ression. We 
found that hig^ levds of A+T always corr^ated with high egression levels (see table 2). It is 
35 also potentially possible to modify chlorofdast protease recognition sites while modifying 
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it affecting their biological fimctions. TherBfoie^ optimizing codon compositions 
of insulin and potymer genes to match die psbA gene should enhance its leyel of translation. 
Although tbcL (RuBisCO) is the most abrmdant protein on earth, it is not translated as highly as 
the psbA gqne due to the extremely high turnover of the psbA gene product. The psbA gene is 
5 under stconger sdecttoa for increased translation efficiency and is the most abundant thylakoid 
parotein. In addition^ the codon usage in higher plant chloroplasts is biased towards the NNC 
codon of 2-fold d^enerate groups (Le. TTC over TTT, GAC over GAT, CAC over CAT, AAC 
over AAT, ATC over ATT, ATA etc.). This is in addxtion to a strong bias towards T at third 
position of 4-fbld degenraate groiqis, Utere is also a context effect that shouki be taken into 
10 cooadmtion while modiiying specific codons. The 2-&]d degenerate sites immediately 
iq>stream from a GNN codon do not show this bias towards NNC. (TTT GGA is preferred to 
TTC GGA while TTC COT is preferred to TIT CGT, TTC ACT to TTT A 
TTT TCT, Morton, 1993; Morton and Bemadette, 2000). In addition, highly wqjiessed 
chloioplast genes use more &equentiy diat other genc& The wdb site 

15 ht<p://wWwJcaZU3a,OT>jp/codon and http-y/wWW Tirl^' nfm nfh,pnv tv> nmH tfi ftptfmiT** orwtnn 

. compositk>n by comparing codon usage of dSaent plant species' gencmies and PsbA^s genes. 
Abundance of amino acids in ddoroplasts and 3Q<IA anticodons ]»esent in chloroplast will be 
taken irito coQsideiation. Optimizatioa of potpner a^ 

approach (ProdromoQ and Pearl, 1992; Casimiro et aL 1997), which has been successfully used 
20 m our lalM>ratory to (^itimize codon oorrq)osition of other hnxi^ 

Vector constructions: For all the constructs pLD vector will be used. This vector was developed 
in ibis laboratory jfor chloioplast transformation. It contains the V6S iRNA promcto (Prm) 
driving the selectable maxker gene aadA (aminog]tycoside adenyl transferase conferring 

25 resstance to qMcdnomycin) followed by the mnltqi^le cloning ate and diea tfaepis&l 3* r^^m 
(the terminator from a gene coding for photosystem II reaction center conq)onents) fiom the 
tobacco chloroplast genome. The pLD vector is a universal chloroplast expression integration 
vector and can be used to transform chloEC^last genomes of several oflker plant species (Daniell 
et al. 1998, Daniell 1999) because these flanking sequences are highly conserved among M^ier 
- 30 plants. The universal vector uses tmA and tml genes (chloroplast transfer RNAs coding for 
Alanine and Isolraichie) fiom the inverted rcpest region of the tobacco chloroplast genome as 
flanking sequences for homologous recombination. Because the universal vector integrates 
foreign g^es within the Inverted Repeat region of the chlon^last genomc> it should double the 
copy number of &e transgene (from 5000 to 10,000 copies per cell in tobacco). Furth^oie, it 

35 has been demonstrated that homoplasmy is achieved even in the first round of selection in 
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bly because of the piesoice of a chloroplast ocigm of rq>lication wilfain the 
flanking sequ^ice in tbe univexsa] vector (thei^ providing more templates for integzation). 
These, and several other reasons, foreign gpa^ e^qpression was shown to be much higher when 
the imiversal vector was used instead of the tobacco specific vector (Guda et aL 2000). 

5 

CTB-Pn^bisalin Vector Construction: The cUoropiast expressicD vector pLD-CTB-Prcnns will 
be constructed as foUowsi First; bodi prcnnsolin and cholera toxm B-subunit genes woe 
anq>liGed fiom suitable DNA. usbg pnmer sequences. Fthner 1 will contain flie GGAGO 
chloroplast preferred ribosome binding site five nucleotides i^pstream of the start codon (ATG) 

10 forthe CTB gene and a suitable lestdction enzyme site (Spel) fi»r insertion into fb& chloroplast 
vector. Pttmer 2 will eliminate fte stop codon and add the first two amino acids of a flexible 
hings tBtrapq)tide GPGP as reported by BergeiDt et aL (1997), hi order to fedlitate folding of tiie 
CTB-proinsulin fiision protein. Primer 3 will add tiie Fcmaimng two amino acids for the hinge 
tetra-peptide and eliminate the prc^scquence of the native prc-proinsulm. Ptimex 4 will add a 

15 suitable r6Stricti(»i site (Spel) for subclcuiing into the chloroplast vector. Anq>lified PGR 
products will be mserted into &b TA clomng vector. BoOi the CTB and pcomsulin PGR 
ficagments will be excised at the Smal and Xbal restdction sites. Ehited fragments will be ligated 
into the TA cloning vector. The CTB-^^roinsulin fiagm^ will be ^dsed attheEcoRI sites and 
inserted into EcoiRI digested dephosphorolated pUD vector. 



20 



We will design the following vectors to opthnize protein expression, puiificaticHi and production 
CI*»<»tet8enora. im^ i r;5r b[toA 

Potential Constnidsi H^ ) ! CTB . J Pite % or | (6y6VB)j Pris ^ 

li^II ^ 1 PtPrfe :| \> or f (GVGyPJd MPii^ 



amgS ^ CTB I ptPris or \ {.GS/<3,Wiii 



of inoteiiis widi ihc same amino add cooqiosition as in human insulin 
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icco plants^ Eibl (1999) demonstrated, in vivo, the dafferences in transiatioii 
efBci^icy and mRNA stability of a GUS leporter gene due to various 5' and 3' untranslated 
regions (UTR's). We intend to implement this alrea^ described sy^ematic transcription and 
translation analysis in our practical endeavor of insulin production. Consistent with Bibles 
5 (1999) data fiirincieased tianslatkm efGdem^ and mRNA stability, we will use the psbA 5* 
UIR in addition with the pdsA 3'UTR aheady in use. 

fragmeat containing 5* pshA UTR will be an4)li&d by PCR usmg tobacco chloroplast DNA 
as templBto. This fiagment will be dcned directly in the pLD vector nmltqxle dozung site 
downstream of die promoter and the aadA gene. The cloned sequence wiH be ezacdy the 
10 sameasintfaepsbAgpie. 

b) Anotiier approach of protein prodoctian in chloropla^ involves potential insulin 
crystallization for Militating purification. The aylAal BadUus thuringiensis operon 
derived putative chapcronin will be used. Expression of the cry2Aa2 openm in chloroplasts 

15 provMes a modd system for fayper-expnession of foreign proteins (46% of total soluble 
protein) in a folded configmation enhancing Ifaeir stabiHly and fedlitating purification (De 
Cosa et aL 200 1 ). This justifies inclusion of the putative chaperonin fiom the ay2Aa2 Cfgmik 
in one of the newly dpaagned canstiucts. In this region tiiere ace two open reading fiames 
(ORFi and ORP2) and a ribosomal bindmg site (rbs). This sequence contains elements 

20 necessary for Cry2Aa2 crystallization, which may help to crystallize insulin and aid in 
subsequent purificadon. Successful crystallization of other proteins \ising this putative 
chaperonin has been demonstrated (Ge et aL 1998). We will amplify the ORFI and ORF2 of 
tlie Bt OylAaZ operon by PCR using the complete openm as template. Subsequent clcming» 
using a novel PCR technique, will allow for direct fusion of this sequence immediately 

25 upstream of the proinsulin fusion protein withmit altering the rmcleotide sequence, which is 
n0cna% necessary to provide a cestrictioaena^nie site (Bb^^ 1988). 

c) Tft address codoit op timiTatimi the prnmsnKn gene will he iailijfirt In a mrtatn mnrHftrytioyiff 
in subsequent constructs Tte jdastid modified proinsulin (PiPris) wiD have its nucleotide 

30 sequence modified such that the codons are optimized for plastid e;q)re£^Qn, yet its amino 
add sequence will remain identical to human proinsuliiL PtPris is an ideal substitute fot 
human proinsulin in the CTB fusion peptide. We intend to compare tiie expression of this 
construct to the native human proinsulin to determine tiie a&cts to codon optimization, 
which will serve to address, in a case study format, one relevant mechanistic parameter of 

35 translation. Analysis of human prraisnlin gene showed that 48 of its 87 codons wexe the 
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[uency codons in the dilacoplast for the amino acid foT which thi^ encode. For 
exanq>le» there aie six different codons for leucine. Their frequency witiiin tiie chloioplast 
genome ranges from 7.3 to 30.8 per thousand codons. There are 12 leucines in proinsulin, 8 
have the lowest firequency codons (7.3), and none code for the highest frequency codons 
5 (30.8). In the plastid optimized pioinsulin gene all Ihe codons will code for the most frequent, 
whereas in faonoan pnnnsulm over half of tiie codons ate Hic least fiequeot Human proinsulin 
nucleotide sequence contains 62% C+G, \irtieieas plastid optimized pioinsulin gene will 
contain 24% C-fG. GensxaiSfy, lower CHj content of foreign genes conelates with h^^ 
levels of eaqpression (Table 2). 

10 

d) Another version of the proinsulin gene, inini-proiiisiilin (Mpris), will also have its codons 
optimized for plastid expres^on, and its amino acid sequence will not differ fom human 
proinsulin (Pris). Pris' sequoice is B Oiain-RR-C C]lhain-KR-A Chain, whereas MPris* 
sequence is B Chain^:R-A ChaixL The MPris sequoice excludes the RR-C Chain, which is 

15 normally excised in proinsulin maturation to insulin. Hie C chain of proinsulin is an 
unnecessary part of in vitro production of insulin. FEoinsoHn folds properly and forms die 
q>propriate di.<ai1fide bonds in the absence of Ac C chain The TeTnaTrnngCT mptif fhat CTSts 
between the B chain and the A chain in MPris allows for mature insulin production upon 
cleavage with trypsm and caiboxypqptidase B. This construct will be used for our proposed 

20 biopolymer fiision protdn. If s oodon optimizatian and amino ^d sequence is ideal for 
mature insulin production, 

e) Our current human proinsolin-biopolymer fusion ^otein contains a &ctor Xa proteol^c cut 
site, whidi saves as a cleavage point between fbe biopolymer and the proinsulin. Onrendy, 

25 cleavage of the polymer-proinsulin fiision protein with tibe &ctor Xa has been f?!^ffi(^>nt in 
our bands. Therofore^ we will rqilace this cut site witii a trypsin cut site, lliis will eliminate 
the need for the eaqpensive &ctor Xa in processing proinsulin. Since proinsulin is canmHy 
processed by trypan in the finmation of mature msulin» insulin maturation and fusion peptide 
clean^age can be achieved in a singjb step with trypsin and carbcncyp^ 

30 

f) We observed incomplete traiislation products in plastids when we e?q)ressed the 12(hrierge^ 
(Ouda et aL 2000). Tbeiefore^ wMe expressing the polymer-proinsulin fusion protein, we 
have decreased the length of the polymer protein to 40mer, without losing the thermal 
reqponsive pn^>erty. In addition, optimal codoiis for glycme (GGT) and valine (GTAX which 

35 oonstitutB 80% of the total amino acids of the polymer, have been used. In all nuclear 
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snes, gj^cme makes up 147/1000 amino adds wbilc in tobacco diloioplasts it is 
129/1000. Highly e^>ressmg genes like psbA and zbcL of tobacco make up 192 and 190 
gly/1000. Therefore, ^ycine may not be a limiting fector. Nuclear genes use 52/1000 proline 
as opposed to 42/1000 in chloroplasts. However, cunently used codon for proline (CCXj) 
5 could be modified to CCA or CCT to fiirdier enhance translation. It is known that pathways 
for proline and valine are conq>aitmentalized in chloroplasts (Ouda et al. 2000). Also, ph>Iine 
is known to acnmrnlatfi in chloroplasts as an osmoprotectant (DanieD et aL 1994). 

g) Codon CGiqpazisQD of the CIB gene with p^A, showed 47% homology with the most 
1 0 fiequent codons of tiie psbA gena Codon analysis showed that 34% of fbe codons of CTB 
are con^limentary to the tRNA pq;ni]ation in the chloroplasts in comparison widi 51% of 
psbA codons that are cnmplimentaiy to the chloroplast tRNA population. Because of the high 
levels of CI B e^qiression in transgenic cfaknoplasts (HemiquBs and Daniell, 2000X there will 
be no need to modify the CIB g^. 

15 

DNA sequence of all constcucts will be determined to confirm the correct orientation of 
genes» in fiazne fasion, and accurate sequences in the recombinant DNA constcucts. DNA 
sequencing will be done using a Peikin Ebner ABI prism 373 DNA seqnendng system using a 
ABE Prism Dye Terminatian Cyde Sequencing kit By usiqg pdmexs for each strand^ insertion 
20 sites at both ends will be sequenced 

Because of &e similarity of protein synthetic machinery (Bnxsy et al. 1997X expression 
of an chlorq[>last vectors will be first tested in E.coH before tiieir use in tobacco transformation. 
For Escherichia coli expression XL-1 Blue strain was used. K coii will be transfbimed by 
25 standard CaCl2niethod 

Bombardment and Regencnthm of ChloroplasI IVansgenic Plants: Tobacco {Nicotiana 
tabacum var. Petit Havana) and nicothie fiee edible tobacco (LAMD 60S» gift firom Dr. Eleith 
Wyoof^ Planet Biotecimology) plants wiU be grown asqytically by germmation of seeds cm MSO 
30 medium (Darnell 1993). Ful^ expanded, dark green leaves of about two month old plants will be 
used for bombardmenL 

Leaves will be placed abaxial side up on a Whatman No. 1 fiit^ paper laying on the 
RMOP medium (Danielle 1993)'in standard petri plates (100x15 mm) for bombardment Crold 
35 (0.6 pm) microprojebtiles will be coated with plasmid DNA (chknoplast vectors) and 
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will be earned out with tixe biolistic device PDSIOOO/He CBio-Rad) as described 
by Daniell (1997). Following bombardmeiit^ petri plates will be sealed wifli parafibn and 
Incubated at 24^C under 12 h photoperiod. Two days after bombardment, leaves wiH be chopped 
into small pieces of ^^5 mm^ in size and placed on the selection medium (RMOP containing 500 
5 jjg^ml of spectinomycin dihydrochloride) with abaxial side touching the medium in deq> 
(lCXlx25 mm) petri plates (~10 pieces perplate). The regenerated q)ectinomycin resistant shoots 
will be chopped itito small pieces (-2mm^) and sabclaned into fiesh deep petn plates (~5 pieces 
per plate) containing Ihe same selectian medium. Resistant shoots fimn die second cuttore cycle 
wiU be transifenped to the rxx>tmgn:xedimn (MSG medium su^ 1 mg^iterand 

10 specdnonrycin dili^diochloridey 500 mgfliter). Rooted plants will bo tranafened to soil and 
grown at 26°C under c<mtinuous lighting conditions for fiirtfaer analysis. 

Polymerase Chain Reaction: PGR will be done using DNA isolated from control and tEansgenic 
plants in order to distingnd^ a) trae dilon^last transformants from nmtants and b) ddoroplast 

1 5 transfomiants fiom nuclear tcansfinmanls. Primas for testing the presence of the aadA gene (that 
confers spectinomycin resistance) in transgenic plants will be landed on the aadA coding 
secpience and 16S dtNA giene (primers IP&IM,). In order to test chloiqplast integratioQ of the 
insulin gsno^ one primer will land on die aadA gene while another will land on the native 
cbloioplast grauxme (primers 3P&3M). No PGR product will be obtained with nuclear transgenic 

20 fdaots usiog this set of primers. The primer set (2P & 2M) will be used to test integration of the 
entire gene cassette without any mtemal deletion or locking out during homologous 
recombination, by landing on the req)ective recombination sites. A Similar strategy has been 
used successfully by us to confiim chloropla^ integration of foreign genes (Daniell et aL, 1998; 
Kota et aL» 1999; Guda et aL, 2000X This screening is essential to eliminate nmtants and nuclear 

25 transformants^ In order to conduct PGR anafyses in transgenic plants^ total DNA fiDom 
unbombarded and transgenic plants will be isolated as described by Edwards et al (1991). 
Ghloroplast transgenic plants containing die proinsulin gene will be moved to second round of 
selection in order to achieve homoplasmy. 

30 Sonthem Blot Analysis: Southern blots will be done to determine the copy number of the 
introduced foreign gene per cell as well as to test homoplasmy. There are several thousand 
copies of the chloroplast genome present in each plant cell. Therefore^ when foreign genes are 
inserted into the chloroplast genome, it ia possible that scnne of the chloroplast genomes have 
foreign genes int^;rated while others remain as the wild type (heteroplasmy). Therefore, in order 

35 to ensure that only die transformed genome exists in cells of transgenic plants Qiomoplasmy), the 
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ss will be CQntmned In order to con&m that flie wild type genome does not exist 
at the end of the selection cycle, total DNA firom transgenic plants should be probed wi& the 
chloroplast bordo: (flanking) seqoences (the tml-tmA fiagmoot, Figure 2A3B). If wild type 
genomes are present ^eterq)lasniy), the native fi:agnient size will be observed along with 
5 tiansfiinned genomes. Presence of a laigp fiagment (due to insertion of foreiga genes within the 
Oariking sequences) and absence of the native small fiagment should confirm bomoplasmy 
. (DanieU et al., 1998; Kota et aL, 1999; Ooda et aL, 2000). 

The copy number of tiie int^prated gene will be detmnined by establishing homoplasmy 
10 for the transonic chlcxroplast genome. Tobacco Chloroplasts contain 5000-10,000 copies of 
their genome per cell (DanieU et aL 1998). If only a faction of the genomes are actually 
transformed, the copy mimber, by de&ult, nm^ be less than 10,000. By establishing that in the 
transgenics the insulin inserted transformed genome is the only one present, one could establish 
that the copy numbo: is 5000-10»000 per cell This is usually done by digesting the total DNA 
15 with a suitable restriction enzyme and probing with the flanking sequences that , enable 
homologous recombination into the chloroplast genome. The native fragment present in the 
control should be absent in &e tran^enics. Hie absence of native fragment proves that only the 
transgenic chloroplast genome is present in the cell and there is no native, untransformed, 
chloroplast gBnorne^witiioiit the iiisullngiene present This establishes the hamoplasmic nature of 
20 our transfcHiiianls, smmllaneously providing us widi an estunate of 5000^10,000 copies of the 
foreign genes per celL 

Northern Blot Analysis: Northern blots wiQ be done to test the ef&ciency of tcanscr^tion of the 
proinsolxn gene fused widi CTB or po^er genes. Total RNA will be isolated fiom 150 mg of 

25 &02m leaves by using the "lUxeasy Plant Total RNA Isolation Kir (Qiagra Inc., Ghatswortfa, 
CA). RNA (10^ pg) wiO be denatured by formalddiyde treatment separated on a 12% 
agarose gel in the presence of finmalddiyde and transfored to a nitrocelhilose mfanfarane (MSQ 
as described in SairibiOQk et aL (1989X Probe DNA (proinsulin gene codmg region) will be 
labeled by ^ candom^piirned medmd (Ptomega) with ^-dCTP isotope^ The blot will be pre^ 

30 hybridized, hyfaddized and washed as described above for southern blot analysis. Transcript 
levels will be quanti&d by ^ Molecular Analyst Program using the GS-700 TmagiTig 
Densitometer (Bio-Rad» Hercules, CA). 

Pdymer-insalin fusion protan pnrificat^0I^ quantitation and chafaeterization: 
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ler insulin fosion proteins exhibit inverse ten^erature transition properties (Piguie 
9 azid 10)» they will be pmified firom transgenic plants essentially following tiie same method 
recently described by us for polymer pmification fiom transgenic tobacco plants (Zhang et 
al.,1996). Polymer extraction bufifer contains 50 mM Tris-HCl, pH 7.5, 1% l-mecaptoetiianol, 
5 5niM EOTA and 2mM PMSF and 0.8 M NaQ. The homog^iate will then be coatiifb^ at 
10,000 g for 10 minutes (4°C)» and the pellet will be discarded The sc^ematant will be 
incubated at 42°C for 30 minutes and then centri&ged immediately fbi 3 minutes at 5»000 g 
(room temperature). If insulin is found to be sensitive to this temperature^ Tt will be loweied by 
increasing salt concentration (McPherson etaL, 1 996), The pellet containing the insulin-polymer 

10 ftooii protein will be lesnqpended in the cxtracticm buffer and incubated on ice for lOnunutes. 
The mixture will be centiifuged at 12»000 g for 10 minutes (4°C). The supernatant will be 
collected and stored at -20^C. The purified polymer insulin fusion-iarotein will be 
electrt^horesed in a SDS-PAGE gel according to Laemmli (1970) and visuafized by either 
staining with 03 MCuClz (Lee ^al. 1987) ortransfen:ed to nxtrocelhilosemcinbraite and probed 

15 m6k antiserum raised against the polymer or insulin protem as described below. (Quantification 
of purified polymer protons will be canied out by EliSA in addition to densttometcy. 

After electiophoresisiy proteins will be transferred to a nitrocellulose membrane 
electrophoceticalty in 25 mM Tris» 192 mM glycine, 5% m^ianol (pH 83). The filter will be 

20 blodced with 2% dry milk in Tns-bu£fered saline for two hours at room tempecatnre and stained 
with antiserum raised against the polymer AVGVP (kindly provided by the University of 
Alabama at Birmingham, monoclonal &cili1y) ovemig^ in 2% dry milk/Tris buffered saline. 
The protein bands reacting to &e antibodies will be visualized «sing alkaline phosphatase-linked 
secondary antibody and the substrates nitrobhie tetrazoHum and 5-bromo-4-<^ilon>-3-indoIylr 

25 phosphate (Bio-Rad). Attematively, for insuhn^lymer fiision proteins, a Mouse anti-human 
pnnnsnlin (IgGl) nionoelQnal aiitibocfy will be used as a primary antibocfy. To detect tihe tmufing 
of tlie primary antibody to tibie recombinant pnunsulin, a Goat anti-mouse IgG Horseradish 
Peroxidase Labeled monoclonal antibody (HPR) will be used The substrate to be used for 
conjugation vritfa HRP will be 33*» S^'-Tetramethylbenzidine. All products will be purdiased 

30 fiom American Qualex Antibodies in San Clemente, CA. As a positive control, human 
recombinant proinsulin fiom Sigma wiU be used. This human recombinant proinsulin was 
expre^ed in E,coli by a synthetic proinsulin gene. Quantification of purified polymer fiision 
{Hroteins will be carried out by densitometry using Scanning Analysis software (BioSofi, 
Ferguson, MO). Total protein contents win be determined by the dtye-binding assay using 

35 reagents siq^lied in MtfitmiBio4(a4 with boviiie serum albumin as a St 
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Characterizatkm of CTB eq>ressioii: CTB protem levels in transgenic plant cnide extract will 
be deteinmednsing quantitative ELISA assays. A standaxdcurve willbe generated using known 
concentrations of bactmal CTB. A 96-well miax>titer plate loaded with lOOpl/well of bacterial 
5 CTB (concentrations in tiie range of l&-100Qng) will be incubated ovemig^ at 4^C. The plate 
will be- washed dmce with PBST phosphate bufTered saline containing 0.05% Tween-20). The 
background will be blocked by incubation in 1% bovine serum albunun (BSA) in PBS 
(300)il/well) at 37^C for 2 h followed by washing 3 times with PBST. The plate will be incubated 
in a 1:8,000 dihition of rabbit anti-cholera toxin antibody (Sigma C-3062) (lOOplMell) for 2 h at 

10 3 /C, followed by washing tiie wells tinee times with PBST. The plate will be mcubated with a 
1:80,000 dilution of anti-iabbit IgG conjugated witii alkaline phoshatase (lOOplAvell) for 2 h at 
37^C and washed ^ce witfi PBST. Then, 100 pi alkaline phosphatase substrate (Sigma Fast p- 
nitrophenyl phosphate tablet in S inl of water win be added and the reaction wiU 
IM NaOH (SOpl/well) when ateorbandes in the mid-range of the titration reach about 2.0, or 

15 after 1 hour, ^chever comes first. The plate will then be read at 40Snm. These results will be 
used to generate a standard curve fiom vAsioh concentrations of plant protein can be extrapolated. 
Thus» total soluble plant protdn (concentration previously detennined using tiie Bradford assay) 
in bicarbonate buffer, pH 9.6 (ISmM NaaCos, 3SinM NaHCOa) will be loaded at 100 plant 
pl/well and the same procedure as above can be repeated. Hie absorbance values will be used to 

20 detemoine flie ratio of CTB protein to total soluble plant protein, using the standard curve 
generated previously and tfie Bradford assay results. 



biheritance of Introduced For^;a Genes: While it is unlikely Usat introduced DNA vrauld 
move fiom the chloroplast ^ome to nuclear genome, it is possible that the gene could get 

25 integrated in foe miclear gaxcnne daring bombardment and remain undetected in Southern 
analysis. Therefore, in initial tobacco transformants, some will be allowed to self-poUinate, 
whereas others will be used in recipiocal crosses wifo control tobacco (transgenics as female 
accq>ters and pollen donors; testing for maternal inheritance). Harvested seeds (Tl) will be 
germinated on media containing qiectinomycin.. Achievement of homoplasmy and mode of 

30 inheritance can be classified by looking at germination results. Homoplasmy should be indicated 
by totally green seedlings (E>aniell et aL, 1998) while heteroplasmy is displayed by variegated 
leaves (lack of pigmentation, Svab & Maliga, 1993). Lack of variation in chlorophyll 
pigmentation among progeny should also underscore the absence of position effect, an arti&ct of 
nuclear transformation. Mateanal inheritance will be demonstrated by sole transmission of 

35 introduced genes via seed generated on transgenic plants, regardless of pollen source (green 



wo 01/72959- 



222 



PCTAJSOl/06288 



elective media). When transgenic pollen is used fixr polHnation of control plants, 
resoltant progeny would not contain resistance to chemical in selective media (will appear 
bleached; Svab and Mahga, 1993). Molecular analyses will confirm transmission and expression 
of introduced genes, and T2 seed will be generated Scorn those confirmed plants by the analyses 
5 described above. 

Comparison of Cnrrait Pvrificatioii wifli Polymer-based Pariflcation Methods: It is 
important to cmxpaie purification mettxxb by testing yield and purity of insnhn produced in 
EmU and tobacco. IhrBe methods wiU be compared: a standard fusion protein in K coU^ 

1 0 polymer proinsolin fadoa protein in E coUy and polymer prainsulm fiisioa in tobacco. Polymer 
proinsulin fusion peptide firom transgenic tobacco will be purified by mdhodology described in 
section c) and Daniell (1997). K co/i pmificatton will be performed as follows. One liter of each 
pLD containing bacteria will be grown in LB/an^iidllin (100 ^g/ml) ovemig^ and the fusicHi 
protein, either polymer-proinsulin or the control fusion protein (Cowl^ and Mackm 1997), will 

15 be expressed Cells will be harvested by centrifiigation at 5000 X g for 10 min at and the 
bacterial peUets will be resuspended in 5 ml/g (wet wt Bacteria) of 100 mM Tris-HCl, pH 73. 
Lysozyme will be added at a ccmcenlration of 1 mg^ml and placed on a rotating dialcer at room 
temper atu re for 15 min. The lysate will be subjected to probe sonication for two cycles of 30 s 
Gii/30 s off at 4^CX CeUular debris will be removed by centrifngation at 1 000 X g for 5 min at 

20 4^C The B. coli pxodoced proinsnhn polymer fusion protein will be purified by inverse 
tranperature transition properties (Daniell et aL» 1997). After Factor Xa cleavage (as described m 
section c)) the proinsulin will be isolated finm the potym^ using inverse teniperature transition 
properties (DaneiU et aL, 1997) subject to oxidative snifitolysis as described below. 
Altomativefy, the control fusion protein will be purified according to Covdey and Mackm (1997) 

25 as follows. The supernatant will be retained and centrifhged again at 27000 X g for 15 min at 
4^C to pellet the inchsum bodies. The supernatant will be discarded and the pellet resuspended 
in 1 ml/g (origtnal wt Bactma) of d^iO, aliquoted into micTDce n ttifuge tubes as i ml ftactions^ 
andthencentrifogedat 16000 Xg for S min at 4^C The pellets will be individually washed with 
1 ml of 100 mM Tris-HO, pH SJ, IM urea, 1-1 Ttiton X-100 and agam washed wiMOO mM 

30 Itis HCl pH8.S, 2 M urea, 2 % Itinton X-1 00. The pellets will be resuspended in 1 ml of dHaO 
and transferred to a pre^wei^bed 30 ml Cdrex centrifoge tube. The sanq)le will be.centrifuged at 
15000 X g for 5 min at 4^Q and the pellet will be resuspended m 10 ml/g (wet wt pellet) of 
70% formic acid Cyanogen bromide will be added to a final concentration of 400 mM and the 
sanqple will be incubated at room temperature in the dark for 16 h. The reaction will be stopped 

35 by trsnsforring the sanq>le to a round bottom flask and removing the solvent by rotary 
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50 The residue will be resuspended in 2Qm]/g (wet wt pellet) of dH20, shell 
602m in a diy ice ^hanol balh, and Htm lyopMiized The lyophilized protein will be dissolved 
in 20 ml/g (wet wt pellet) of 500 mM Tris-HCl, pH 8^ 7 M urea Oxidative sulfitolysis will be 
perfonned by adding sodium sulfite and sodium tetrathionate to final concentrations of 100 and 
5 10 mM, lespectively, and incubating at loom temperature for 3 h. This reaction will be stopped 
by fieezmg on dry ice. 

Pnrifieation and fioldiiig of Human Proinsulin: The f-solfanated matenal will be qiplied to a 
2 ml bed of Sqphadesc G-25 equilifarated in 20 mM Tds-HC^ 

10 with 9 vols of 7 M urea. The collected fiaction will be applied to a Phatmada Mom) Q HR5/5 
coiunm equilibrated in 20 mM Tns HCI, pH 8^ 7 M urea at a flow rate of 1 mUnnn. A Hnear 
gradient leading to final conceaitraticm of 0.5 M NaCl will be us^ to ehite die bound matedaL 2 
min (2 ml) fractions will be coUected during die gradient^ and protein concootradon in eadi 
fraction will be detemnned Purity and molecular mass of fractions will be estimated by Tricine 

15 SDS-PAGE (as shown in Fig. 2)» where Tricine is used as the trailing ion to allow better 
resolution of peptides in Ifae range of 1-1000 kDa. Appropriate fiactians will be pooled and 
^lied to a 1 .6 X 20 cm column of Sqihadex G-25 (superfine) equilifarated in 5 mM anmnanium 
acetate pH 6.8. The sample will be collected based 00 UV absorbance ai^ fieeze-dzied. The 
partially purified ig-snlfonated mataial will be resuspended m 50 mM gfycine/NaO^ pH 10.5 at 

20 a final concentration of 2 mg^mL pHuercaptoedianol will be added at a ratio of 1 .5 mol per mol 
of cysteine iS^-sulfonate and the saniple will be stirred at 4^C in an open coritarner for 16 h. The 
sample will be then analyzed by reversed-phase faig^perfrmnance liquid chromatography (RP- 
HPLC) using a Vydac C4 cohmm (Z^ X 150 mm) equilibrated in 4% acetonitrile and 0. 1 % TFA 
Adsorbed peptides will be ehitcd with a linear gradient of increasing ac^onxtriie concentration 

25 (0.88% per min iq> to a maximum of 48%). Hie remaining refolded pnxinsolin will be 
centrifiiged at 16000 X ^ to remove insoluble matnial, and loaded onto a semi-preparathre 
Vydac Q cotonm (10 X 250 mm). The bound material wiU be ehited as described alx^ 
proinsulin win be collected and lyopihilized. 

30 Analy^ and characteriaBation of insnltn eqiressed in K ctM and Tobacco: The purified 
ejqxiessed proinsulin will be subjected to matrix-assisted las^ desoipdonAonizatioQ-time of 
fli^ (MALDI-TOF) analysis (as described by Cowley and Maciin, 1997), using proinsulin 
from EH Lilly as both an internal and extraml standard. To detennine if the disulfide bridges 
have formed correct^ naturally inside (Moroplasts or by in \ntro processing, a proteolytic 

35 digestion will be petfinnaied using Staphylococcus aureus protease VS. Five [ig of both the 
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nsulin and Eli Lilly's promsalin will be lyophilized and resuspended in 50 |d of 
250 mM NaPO^, pH 7^. Protease V8 will be added at a ratio of 1:50 (wAv) in e?q>erimental 
samples and no enzyme will be added to the contiols. All samples will then be incubated 
ov^mi^ at 37^C, the reactions will be stopped by free2ing on dry ice, and samples will be 
5 stored at -20 nntO analyzed The sanq>le5 will be analyzed by RP-HPLC using a Vydac Q 
colunm {22 X ISO mm) equitibratBd in 4% ac^nitiile and 0.1% TFA. Bomid material will be 
elnted using a linear gradient of increasing acetonitrile concentrntioii (0.88% per min up to a 
maximnm of 48%). 

1 0 CTB^sMl gangBoside bindii^ assay: A GtAl -ELBSA assay will be penfomied as desoribed by 
Aiakawa et al (1997) to deteanine the alGBmfy of plant-denved CTB ftr GMl-gaoilioside. The 
midotiter plate will be coated with monosialogang^oside-GMl (Sigma G-7641) by incubating 
the plate with 100 pl/weU of GMl (3.0 ^ig^) in bicarbonate bufifer, pH 9.6 at 4 ovemi^ 
Alternatively, the wells will be coated with 100 jilAvell of BSA (3.0 pg/ml) as control. The plates 

1 5 win be incubated wi& tiansfonned plant total soluble protein and bacterial CTB (Sigma G-9903) 
in PBS (100 lilAvell) ovemigjht at 4^C The xesnaindor of the procedure will be identical to the 
EUSA described above. 

Indactioii of oral tideraice: Four wedc old female NOD mice will be purchased &om Jackson 
20 Labomtoiy (Bar BaAoT^ ME) and housed at the animal caze fedlity located in the school of 
Biology at fte University of Central Florida (UCF). The mice will be divided into three groups, 
each group consisting of ten mice. Eadi group will be fed one of the following nicotine &ee 
edible tobacco: imtransfbrmed, expressing CTB, or c^qoressing CTB-proinsulin fusion protein. 
Beginning at 5 weeks of agB» each mouse will be fed 3 g of nicotine free edible tobacco once per 
25 wedc mitil reaching 9 weeks of age (a total of five feedings). 

Antibody titor: At te& weeiks of age, the senim and fecal material will be ass^ed for anti-CTB 
and anti-proinsulin antibody isotypes using die EliSA method described above. 

30 Assessment of diabetic ^mptoms in £iOD mice: Hie incidence of diabedc symptoms will be 
compared among mice fed with control nicotine free edible tobacco diat expresses CTB and 
tiiose that express the CTB-proinsolin fusion proteiiL Starting at 10 weeks of age, the mice will 
be mrantoied on a biweekly basis widi urinary glucose test strips (Clinistix and Diastix, Bayer) 
for development of diabetes. Glycosuric mice wiU be bled firom the tail vein to check for 

35 glyoemia using a glucose analyzer (Accu^Sieok, Boehring^ Mannheim). Diabetes will be 
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Lyperglyc^iiia (>250 mg/dl) for two consecutive weeks (Ma et al. 1997). The plant 
tissue of control and tiansgiemc plants to be fed to mice will be provided for a coDaboiator (Dr. 
All Amiiidiosravi, Florida Hospital) to perfonn these studies. A letter of collabaratiGn is 
provided documenting this arraagement 

5 

Tratative Proposed Schedule 



Year I: 

a) Develop lecomhfnaitt DMA vectors for enhanced transMcm of pzoinsalin as fnsion protBui 
1 0 with protein based polymers or CTB via dilon^>last genomes of tobacco 

b) Obtain transgenic tobacco plants using the transformation vectors 

c) Assay transgenic esqxression of insulin-polymer fusion protrin and CIB in chlorqplasts nang 
molecular and biochemicril methods 

15 Yearn: 

d) Emplcy escisting methods of polymer pnrificatiim ficmi transgenic leaves or develop new 
^qpffoaches fixr die fusion protein and estimate levels of expression 

e) Analyze genetic conq>ositiOQ of transgenic plants (Mi»ideliaa or mat^nal inheritance) 

f) Large scale purification of insulin ficom green house grown transgenic plants and ccmqparison 
20 ofcmieminsulhi purification methods widipofyrner-lKi^ 

Yearm 

g) Refolding and diaiact^izatiQn (yield and puhty) of proinsulin produced in RcoU and 
transg^c tobacco 

25 h) Assessment of diabetic s ynipfaii ns in NOD mice fod wifit leaves eqviessing CIB-proinsolin 
fusion protein 

j) Assessment ofimmmieieqjoiise in niice fed wifli leaves egqnem^ 
k) Qmtinue to characterize subsequent transgenic genegatioPs(^ 
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H Vertelxate Animab 

Discrq^don of proposed work: Oral tolerance and the inddence of diabetic symptoms will be 
compaied among mice fed with nontransgemc tobacco (negative ccmtrol), CTB expressing 
5 nicodne free edible tobacco (LAMD 605) and those that e}q)iess the CTB-proinsulin fuaon 
protein. Unity, f^ale nonobese diabetic (NOD) mice, fonr weeks of agp, will be purchased 
from Jackson Laboratory (Bar Hadxxr, ME), and housed at the animal caie &cility located in the 
sdiool ofBiobgy at the Univecsity of Central Fioiida (UCP). 

10 Ezpttim^itel groiqps: Hie mice will be divided into the following groups^ each gtoup 
oonsistmgof tenmice: @coup fed nntiansfomied lAMD 6C^; g;coiq»2» fed transgeoic LAMD 
605 syntibesizmg CTB; and group 3, fed transgenic LAMD 605 syntliesizmg CTB-promsalin 
fusion protein. Begoming at frve weeks of ag^ each mouse will be fed 3 g of LAMD 605 once 
per weeknntil reaching 9 weeks of age (a total of five fiscdings). At ten wedcs, serum and fecal 

15 material will be assayed for anti-CIB and anti-insulin antibody isotyypes using ELtSA as 
described above. 

The incidence of diab^ syrnptoms will be compared among mice fed transgenic LAMD 605 
synthesizing CTB and LAMD 605 synthesizmg CTB-promsulin fusion protein. Starting at 10 
20 wedcs of age, die mice will be momtored on a biweekly ba^ with urinary ghicose test strqis 
(Clinistix and Diastix, Bayira) for development of diabetes. Glycosuric mice will be bled frtxn the 
tail vein to check for g^rcemia using a g^ose analyzer (Accu-Check, Boehnnger Mannheim). 
Diabetes will be confirmed by hypeiglyDemia 0^250 mg^dl) for two consecntive weeks (Ma et aL 
1997). 

25 

Inrv'estigaton The plant tissue of control and transg^c pla^ 

for a collaborator, AH Amoxkhosravi ^iD . (Florida HospitalX to pcxfimn diese studies. A letter of 
GollaboraticMi is provkied doarnienting diis arrangm^t Dr. Amiridiosravi's esqpertise for 
perfixming scioxtific investigatiaiis involving animals is demonstrated by his esqpedence and 
30 publications provided in his resume. 



Justification of species selection: Female NOD mice have a high incidence of developing 
autoiDumme diabetes after 12 wedis of age (Gaskins et al. 1992). Th^fore, they are the 
^ropriate model for our study for tiie prevention of autoimmune diab^es. According to the 
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previous oq^erieoce and Aiakffwa et aL (1998), ten mice for cadi group, tfairly 
mice total, will pxovkie our study with the amount necessary for reliable data. 

Vetemary care: Fand Tomson DVM is the veterinary consultant for the UCP animal care 
5 &a]ity. 

IMscomfort, distress, pain, and injury: According to the investigator's e3q)erience, the mice 
involved in this investigation will not experience discomfort or severe symptoms* inchidtng 
severe diabetic syn^toms. For&etmor^ die qiecLGc diet &r die mice is well toleroted Ardmals 
10 vnR be checked regolaxty; and in case of any visible signs of d^tress or pain, aniinals will be 
ranoved fiom die stu^» however this is nnlifcely. 

gnthanasia: I^n oompletiQn of the study mice will be ^xthanized by an overdose of die 
inhale anesdietic hakfthanci^ which is a standard mcdiod of euthanasia. This method is 
15 consistent with the recommendations of die Panel on Euthanasia of the American Veterinary 
Medical Assodatioa 
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PRODUCTION OF HUMAN INSULIN IN TRANSGENIC TOBACCO 
Henry DanieD, Principal Investigator 

5 As the value of molecular &nnmg is realized, investigations are being perfonned to 

develop plant based expcessiosi systems. Aside from being an eoviionmaitaUy fiiendly approach 
chloroplast genetic engineering continually exceeds nuclear gen^ engtneenng in total 
pnxhidion of foreign proteins in plants. A recent publication from our lab (featured on the cover 
of Nature Biotechnology, January 2001) demonstrated foreign gene expression up to 46% of ^ 

10 total soluble protein in chloroplast transgenic plants. However, the full potential of this 
technology is yrt to be realized fioar biopharmaceutical production. Our proposed investigation 
entails both maximizing pcomsulin production in chloroplast transgenic plants and evaluating 
propc^ed imidi&ations for future chlonip^ 



IS Vector construction to synthesize the Qiol»a toxin B (GIB) subimit fused to native 

human proinsulin was first con^Ieted. As described on page 36 of this pcopoesH^ standard 
mcdecular bu>Iogtcal techniques wea:6 used to create the sequence for the fiision protein DNA 
• from indivulnal genes eocodmg CTB and native human pioiiisulin. The DNA encoding this 
fiision protein was tixen snb cloiied into the diloiop]asttEan8£Qrmati0n vector 

20 

AUhougJi the pLD vector contains all the necessary elements for chlon^last e3q>rcssion 
of the CIB^iiroinsuHn fusion protein, an irnportant goal of this mvestigatton is to evahiate the 
elements of translation that maximize foreign protein production. Therefore, as proposed on 
pages 35-38» we have developed additional constrocts, each with a different modification, to 
25 allow for both ^ optunizatiGn of CIB-proinsulin gene e^qpression and evaluation of &ese 
modificsttotts* 



We have cloned the 5' untranslated region of the tobacco psbA gene inchiding the 
promoter (5*UTR)> shown in figure 1 and as proposed on page 36-37. We performed PGR 

30 using the primers CCGTCGACGTAGAGAAGTCCGTATT and 

GCCCATGGTAAAATCTTGG TTTATTTA, which resulted in a 200 base pair product, as 
e9q>ected. We inserted this PGR product into a TA cloning vector. Since restrictian ena^e sites 
were not avmlable to subclone die 5'UTR immediately upstream of &e gene codmg for the GIB- 
proinsulin fosicHi protein, we used the ^^Emg^PCRtecfanique^ described cm page 37, to create 

35 the DNA sequotcc widi Ihc S'UTR immediately iq)strcam of the CTB^roinsnlin gene (Figiire 
2). The products of this PGR mctude both the SUIR (200bp) and tiie gene for GTBixxKnsnlin 
(600bp) as additional products as well as the desired S*UTR GTBixroinsnlin (5GF) at 800 tp. 
5CP was ehited and th«i inserted into the TA cloning vector where DNA sequencmg was 
performed to confirm accuracy of nudeotide sequence before it was subdoned into foe pLD 

40 vector. 



As discussed on p^e 35, chloroplast foreign gene expression conelates well widi %AT 
of Ihe gene codmg sequeooe* The native famnan proinsulin se 

synthesized chloroplast optimized proinsulin is 64% AT. We detennined foe optimal chlon^last 
45 coding sequence for foe proinsulin QPTpris) gene by using a codon composition that is equival^it 
to foe highest translated chloroplast gene, psbA. The prefered codon con^iosition of psbA in 
tobacco is conserved within 20 vascular plant Epedes. We have conqiaredit to foe native human 
jnoinsulin DNA sequence (Ftgnre 3). Since foere are too many changes fbr conventional 
mutagenesis, we employed the Recursive PGR method fbr total gene synfoesis, as described on 
50 page 35; Figure 4 shows the product of fois gene synfoesis corresponding to the 280 bp expected 
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This product PTpds, was tiien used as a template widi CTB and S'UTR to create a fusioa 
of these sequences using the SOEing PGR technique described on page 37. The products of this 
reaction can be seen in figure 5. These include 5*UTR (200 bp), CTB (320 bp), ftninsulin (280 
5 bpX ^d CTB-Proinsulin (600 bp) as side products, and also the desired 5*UTR CTB-FI^pcis 
(5CPTP) at 800 bp. This was then inserted into the TA cloning vector wheie the sequence was 
vedfied before being subcloned into the pLD YBCtor. 

Another parameter of foreign protein {production to be investigated is post-translational. 

10 As discussed on page 37. die DNA for the puitative chaperonin in the Bacillus thuringiensis Cry 
2A2 openm encodes a protein that could potentially fold and crystallize CTB-Ptoinsulin, whidi 
would allow it to accmnnlate in large quantities protected fiom chlaroplast proteases and 
facilitate in subsequent purification. Standard molecular biology techniques were used to insert 
diis DNA firagment immediately upstream of the S^UIR of ^ construct contahnng the 

15 cfaloroplast optimized pnnnsulirL Addittonally, another vector was constructed to contain only 
Shine-Dalgamo sequence (GGAGG) followed by the sequence encoding for the Cholera toxin B 
subunit and synthetic chkzoplast optimized pconisnfin fiision (CTB-FTpris). This construct will 
allow us to detcfmine the value of the prdisalin sequence modification both wifii and widumt 
theS'UTR. 

20 

An of ^ resuhipg vectors^ containing the desired constructs, were used to transform 
botii of the tobacco cultivars^ Petit Havana and LAMD 605 (edible tobacco). Transformation was 
peif(»ined using the particle bombardmCTt method, as d^cribed on page 38-39. Bombarded 
leaves are currently being regenerated into transg^c plants under spectinomycin selectioiL 

25 Several clones have begpm to form shoots. The clones of Petit Ibvana bombaided with the initial 
CIB-hnman promsuHn construct have regenerated large enough for us to extract DNA. 
Bxlracted DNA was used as a template in a PCR reaction to confirm integration of the cassette 
into the chloroplast g^iome by homologous recombination, as described on page 39. We used 
two primers in tixis reaction. 3P and 3M. 3P anneals with die native chloroplast genome, while 

30 3M anneals with die gene for spectinomydn resistance. aadA. The 1600 bp product of this 
reaction is indicative of integration of die construct into the genome (Figure 6). This experiment 
demonstrated that 7 of tiie 11 analyzed clones were the deshed chloroplast transgenic plants. 
Westocn blots are currently underway to confirm expression of various CTB-proinsulin fusion 
proteins in B. coU, Because of the similarity of chloroplast and R coli protein syndietic 

35 machinery, chloroplast vectors are routinely tested in our lab before bombardment Mendxranes 
have been imnnmoblotted with antibodies to both CTB and Proinsuhn. Results demonstrate the 
presence of flie desired fusion proteins* 

We eagerly await tlie regeneration of dte remaining chk>roplast transgenic plants. Our 
40 analysis of these plants will provide essoitial information to develop diis technology for future 
biopbarmaceuticai prodnctioiu Our investigation will also establi^ a method for production and 
delivery of orally admiinsterod protein thenq>ies. Widi adequate production of the CTB- 
proinsutin fusion protran in the edible tobacco plants^ direct consumption of the plant tissue, as 
described on page 43, by NOD mice will prolong or prevent the onset of die autoimmune 
45 diabetes. 

This project has aheacfy overcome irdtial expmmraita! challenge by successfully 
constructing chloroplast vectors with difTerent regulatcny regions utilizing most challenging 
recombinant DNA techniques. Bodx native and codon optimized synthetic proinsulin genes have 
so been inserted ixitochUiroplast vectors. Our laboratory is qmte ef5de^ 

steps to take diis project to a successful con^letioiL Wc look fixrward to NIH fimding of fliis 
proposal to make rapid progress in proposed objectives. 
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a. SPECIFIC AIMS 

Research on human proteins in the past yeais has ievohitioni23Bd the use of these 
therapeutLcally vahiable pratems m a variety of dinical situations. Since the demand for these 
5 protems is e3q[»ecled ID increase considerably in the 

in fhtare fliey will be available hi sigpificantly larg^ amoun1s» preferably m a cost-efifective 
basis. Because most genes can be expressed in many different systems^ it is essential to 
determine which system offers the most advantages for the mano&ctnre of ttie recombinant 
proteuL The ideal e3q)iession syst^ would be one that produces a maidnmm amount of safi^ 

10 biologically active material at a minimum cost The use of modified mammalian cells with 
recombinant DNA techniques has the advantage of resulting in products which are closely 
related to those of natural origm; however, culturing of these cells is intricate and can only be 
canded out on limited scale. Ihe use of microorganisms such as bacteria pennits mano&cture cm 
a larger scale, but iutroduoes die disadvantage of producing products, ^drich differ appreciably 

15 fiom the products of natural origin. For exanqple^ proteins diat are nsnally glycosylated in 
humans are not gj^ycosylated by bacteria. Purtfaennoie» huma^ 

levds in E. ccti frequently acquire an onnatnral confonnation, accon^anied by intracellular 
(seci]Mlation due to lack of proper folding and disulfide bridges. Production of reccHubinant 
proteins in plants has many potential advantages for generating taophannaceuticals relevant to 

20 clinical medicine. These include the following: (I) plant systems are more economical than 
in^jstrial facilities using fomentation systems; (ii) technology is available for harvesting and 
processing plants/ plant products on a large scale; (iii) elimination of the purification requirement 
when the plant tissue containing die recombinant protein is used as a food (edible vaccines); Gv) 
plants cap be directed to taigctproteais mto slatte» intCTcelhlar coaqpartments as cfalaroplastSy or 

25 expressed directly in ddorophst^ (v) the amount of recQmbinant product diat can be produced 
appzoadies industrial-scale levels and (vi) health risks due to oontanmiation widi potential 
human padiogens/lDixins are minimized. 

It has been estimated that one tobacco plant should be able to produce more recombmant 
30 protein ttian a 300-liter fermenter of K colL In addition, a tobacco plant produces a million 
seeds^ fedlitating large-scale production. Tobacco is also an ideal choice because of its relative 
ease of genetic manipulation and an impending need to e^qilore alternate nses for this Irazardous 
crqp. However^ with the exception of enzymes (6.g. phytase)» levels of foreign proteins produced 
in nudear transgenic plants are genenilly low> mosdy less Ihan 1% of the total sohble protein 
35 (1 ). May ^ aL C2a) discuss ttns problem nai^g die following exanqples. Aldum^ plant derhred 
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^)3titis B SUI&C6 aiitigen was as effective as a commerdal recombinant vaccine^ 
tiie levels of e?q)ression in tnmsgemc tobacco were low (0.0066% of total soluble prot^). Ev&t 
thou^ Norwalk virus capsid protein e^qiressed in potatoes caused oral inmmnization when 
consumed as food (edible vaccine), expression levels were low (03% of total soluble protein), in 
5 paiticolar, expressum of Imnian proteins in nndear transgenic plants has been disappointuigly 
low: e.g. human hitErferon-O 0.000017% of fiesh wei^i^ human sennn albumin 0.02% and 
erytiiropoietin 0.0026% of total soluble protehi (see tablel in refl). A syntiietic gene coding for 
the human epidermal growth fiictor was expressed only vp to G.001% of total soluble protein in 
transgBoio tobacco (2a). The cost of producing iec(»nbinant protdns in a]&]& leaves was 

10 esthnated to be 12-fold lower titan in potato tubers and comparable with seeds (1). However; 
tobacco leaves are irroch larger and have much hig^ biomass than al&lfe. The cost of 
production of recombinant proteins will be 50-fold lower than fbat oiKcoU fomentation (with 
20% e^uession levels, 1). A decrease in insulin expression fiom 20% to 5% of biomass doubled 
^ cost of production (2b). Bqnession level less dian 1% of total soluble protein in plants has 

15 been finmd to be not coinmennally feasible (1). Therefore^ it is impor t an t to increase levels of 
expression of lecomhinant proteins in plants in order to esq>loit plant production of 
phazmacologtcaUy iir^rtantprotdns. 

An ahemate q>proach is to esqaress fbrdgn protdns in cMoroplasts of higher plants. We 

20 have recently integrated foreign genes (up to 10^000 copies per cell) into the tobacco chloroplast 
genome resuUmg in accunmlation of recombinant proteins up to 47% of the total ceOnlar protein 
(3). Chloroplast tiansibrmation utilizes two flanking sequences diat, throng^ homologous 
recombination, insert foreign DNA into the spac^ region between the fonctioDal goies of the 
chlarq>last genome, dms targ^mg the foreign genes to a pzedse location. This elimmates die 

25 ^^position effecf* and gene dlendng frequently observed m nuclear transgenic plants. 
Chloroplast g^etic engmeexui^ is an envinnmientally fiiendly f^proadi, minimizmg concerns 
of out-cross of intcoduced traits via pollen to weeds or odier crops. Also, the ooncems of insects 
developing resistance to biopesdddes are TrnnimizBd by liyper-eTqiression of smgle insecticidal 
proteins (hig^ dosage) or esqpiressianof difTeieDtQpes of insecticides in a smgle transformation 

30 event (gene pyramiding). Concems of insectiddal proteins on non-target insects are minimized 
by lack of expression in transgenic pollen. Most importantly, a significant advantage in the 
production of pharmaceutical proteins in chloroplasts is their ability to process eukaryotic 
proteins, including folding and formation of disulfide bridges (4). Chaperonin proteins are 
present in chloroplasts (5,6) diat function in folding and assembly of prokaryotic/eukaryotic 

35 proteins. Also, proteins are activated by disulfide bond oxido/reduction cycles using ^ 
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oredoxfn system (7) or chloroplast porotem disulfide isomerase (8). Accumuladon 
of fully assembled, disulfide bonded fonn of hninan somatotropin via chloroplast transfonnation 
(9) and oligomodc form of CTB (10) aad assembly of heavy and light dmins of humanized 
Gruys 13 antibody in transgenic chloroplasts (11) provide strong evidence for successful 
5 processing of pharmaceutical proteins inside chloroplasts. Such folding and assembly should 
dinnnate the need for hig^ expensive m vitro processing of phannaceotical proteins. For 
exanqsle^ 60% of the total operating cost in the production of hunxan insulin is associated with in 
vitro processmg (formation of disofide bridges and cleavage of methionine)C2b). 

10 Taken together, low levels of expiessiQa of hnman proteiiis in nuclear hcansgenic plants, 

and difBculty in folding, assembly^izocesstng of human protdns in Kcoti should make 
diloiX)plasts an ideal compartment for expression of these protem^ production of human proteins 
in transgenic chloroplasts should also dramatically lower the production cost Large-scale 
ptoduction of these protehis in plants should be a powerful approach to provide treatment to 

1 5 patients at an affordable cost and provide tobacco fermeis alternate uses for this hazardous crop, 
llierefore^ we propose here esqpressian of tfierqieutic protons in transonic tobacco chloroplasts 
to increase levels of exprossion and acconq>liA £ri vrM> process!^ 

Objectives 

20 a) Devetop cecomhmant DNA vectors fin: enhanced expression of Human Senun Albumin, 
Insulin like growth iactor I and Interferon-n 2 and 5, via chloroplast genomes of tobacco 

b) Optbjiize processing and purification of phainiaceutical proteins ua^ 
KcoU 

c) Obtain transgenic t[^}acco plants 

25 d) Characterize transgenip expression of protdns or &aion proteins using molecular and 
biochemical methods in chloropkute 

e) E^loy existing or modified niethods of purification fiomtiaiisgem 

f) Analyze MendeUan or matramal inheritance of tran^enic plants 

g) Large scale purification of tfaexqieutic proteins fiom transgenic tobacco a^ 
30 cmient purification rnethods in jSlcoiSf or yeast 

h) Conq>areimtural refolding in chloroplasts with existing in vi^ 

i) Companson/charactenzation (yield and purity) of ther^utic proteins produced in yeast or 
Kcoli vfitix transgenic tobacco chlorc^lasts 

j) In vitro and in vivo {pre-<^hdcaltja^ 
35 ' 
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b. BACKGROUND AND SIGNIFICANCE 

HUMAN SERUM ALBUMIN 

HSA is a monomeric globular pcotem and consists of a single, generally nong!ycos>iated, 
5 polypeptide chain of 5&5 amino adds (66.5 KDa and 17 disulfide bonds) with no 
postianslationalmodificalions. It is canapoBed of three structorally similar globular domains and 
the disulfides are positioned in repeated secies of nine loc^tink-lopp stmctures centered around 
eig^ sequential Cys-Ciys pairs, HSA is initially synfliesized as pre-pto-alhumin by the liver and 
released fiom the endoplasniatic reticulum after removal of the amtnoterminal prepc|itide of 1 8 

10 amino adds. Tlio pro-albunmi is fhrthn pxocessed in the Golgi oaxaplm vAme the other 6 
aminoteaninal residues of the propqrtide are cleaved by a setme proteinase (12). This results in 
thesecretionof the mature polypq)tide of 585 amino acids. EISA is ^coded by two codominant 
autosomic allelic gienes. HSA belcmgs to tiie mnhigene &nily of proteins that include a^dia'^ 
fetoprotein and hun^ group-specific component (Qc) or vitamin D-binding family. HSA 

15 facilitates transfer of many ligands across organ circulatory inter&ces such as in the liver» 
intcstiiie^ kidney and brain. In addition to blood plasma, senim alHiitnin is also found in tissues. 
HSA accounts for about 60% of the total protem in blood serum. In Hbe serum of human adults, 
the eoricentration of alburnin is 40 rng^lmL 

20 Medical appUcatioiis: The primary fbnction of HSA is the maintenance of colloid osmotic 
pressure (COP) within the blood vessels. Its abundance makes it an important determinant of tiie 
pharmacokinetic behavior of many drags. Reduced syndiesis of HSA can be due to advanced 
liver disease, impaired intestinal absorption of nutrients or poor nutritional intake. Increased 
albumin losses can be due to kidney diseases (increased ^omernlar permeability to 

25 macromolecules in the nephrotic ^dromeX intestinal diseases (protein-tesing enteropadnes) or 
exudative skin disorders (bums). Catebolic states such as chronic infections, s^»sis, surgery, 
fatefitnial yeseetimi, tmnma. or eartervavti fawns can afan canset hyprialhnmin«Twa HS^ jj? jn 
therapy of blood vcdnme discxdeis;, far example posdiaemorrliagic acute hypovolaemia or 
extensivB bums, treatment of dehydration states, and also far dnhotic and hqxitic illnesses. It Is 

30 also used as an additive in pexfhston liquid fiirextr^^ 

fiur replacing blood volume, but also has a variety of non-ther^>eutic uses» including its role as a 
stabilizer in fonmdations for other ther^>eutic proteins. HSA is a stabilizer for biological 
mflterials in nature and is used far pr^aring biological standards and reference materials. 
Furthermore, HSA is fiiequeotfyused as an experimental antig^ a cellrculture constituent and a 

35 standard in cIinicaI-K±emistry tests. 
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fixpressioii systems: Hie expression and pinification of lecomhmaiit HSA &om various 
microoiganisms has been reported previously (13-17). Saccharomyces cerevtsiae has been used 
to produce HSA both intracellulaiy, requiring denaturation and refolding prior to analysis (18^ 
5 and by secretion (19). Sealed HSA was equival^t structurally, but the recombinant product 
had lower levels of eqnession (recovery) and structural h^erogeneity compared to the blood 
derived protein (20). HSA was also expiessed in Kb^verowyces tactis, a yeast with good 
secretary properties adneving 1 g/Iiter in fed batch cultures (ZlXOhtanietal (22) devefepe^ 
HSA expzes^on system using Pidaa pastoris and established a purification mettiod obtaining 

10 leooEmbiiiant protein wittisinmlarlevBb of purity 

.si^l£^x7» HSA could be secr^ed using bacterial signal pepti coH 
was successfid but required addxtional m processing with trypsin to y^^ 
(14). Sijmons et aL C23) e3qn:essed HSA in transg^c potato and tobacco plants. Fusion of HSA 
to the plant PR-S prcscqucocc resulted in cleavage of &e presequence at its natural site and 

15 secretion of correcdy processed HSA» that was indistingoishable fiom the authentic human 
protein. ThB ^resson was 0.014% of the total soluble protem. However^ none of diese 
meidiods have been exploited connnercially . 

Challenges in commercial production: Albumm is currently obtained by protein fiactionation 

20 from plasma and is tiie wodd's most used intravenous protem, esthnated at around 500 metric 
tons per year. Albumin is administmd by intravenous injection of sohitions containing 20% of 
albumin. The av»:age dosage of albumin for each patient varies b^een 2O40 grams/day. Ilie 
consumption of albumin is around 700 kilograms per milUon habitants per year: In addition to 
hig^ cost» HSA has ^ risk of transmitting diseases as wi^ otiior blood-derivative fsoducts. 

25 Thd price of albumin is about $3.7/g. Thus, the marlcet of Ibis protein approximately amounts to 
$ 2,600»Ci00 per nuDion people per year (0.7 biUioH dollars per year in'USA)L Because of the 
hig^ cost of albumin, synttietic macromolecnks (like dextians) are used to increase plasma 
coUoidosmotic p]:essure. 

Connnennal HSA is mainly prepared finm human plasma. This source, hardly meets ^ 

30 lequirements of ttie world market The availability of human plaiawfl is limited and caiefiil heat 
treatment of the product prq)ared must be performed to avoid potential contamination of flie 
product by hepatitis, HIV and other viruses. The costs of HSA extraction from blood axe very 
high. In order to meet the demands of the large albumin market with a safe product at a low cost, 
innovative production systems are needed. Plant biotechnology offers promise of obtaining safe 

35 and cheap proteins to be used to treat human diseases. 
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INTERFERON ALPHA 

Interferons (IFNs) constitute a heterogeneous himfy of cytokmes wi& antiviial, 
antigrowtfa» and inmumomodulatory properties (24-26). Type I IFNs are acid-stable and 
5 constitate the first line of defence against viruses, both by displaying direct antiviial effects and 
by interacting with die cytokine cascade and die imnnme ^stem. Their function is to induce 
zpgulation of growtii and dif^entiatkm of T cells. The human IFN-a &mily consists of at least 
22 intronless genes» 9 of which are pseudogenes and 1 3 expressed genes (subtypes) (27). Buman 
IFN-a genes OKsode proteins of 188 (ht 189 amino acids. The fust 23 anxino adds constitute a 

10 fflgpal peptide, and fte other 165 or 166 amino adds form ^ mature protein. IFNki subtypes 
s3iow 78-94 %homolpgy at die nucleotide level Flesence of two disulfide bonds between Cys- 
l:Cys-99 and Cys^29:C^139 is conserved among all IFN-a spedes (28). Human IFNkx genes 
are expressed constitutively in org^ of narmal individuals (29^0). IndividuallFN-a genes aie 
differently expressed dq>eQduig on the stimulus and they show restricted cell type e]q)ression 

15 pi). Although an IFN-a subtypes bind to a common receptor (32), several reports suggest ihat 
they show quantitatively distinct patterns of antiviral, growtii inhibitory and immunomodulatory 
activities (33). IFN-a8 and IFN-aS seem to have the greatest antiviral activity in liver tumour 
cells HuH7 (33). IFN-ct5 has, at least, die same antiviral activity as IFN-a2 in in vitro 
experimffiits (unpublished data in Dr. Prieto' s lab). It has been shown recentiy that IFN-a5 is die 

20 sob IFN-a subtype e^iressed in normal liver tissue (34). IFN-a5 expression in patients with 
chronic l^»titils C is rechKed in die fiver (34) and induced in mononuclear ceUs (35). 

Medical appUcatfons: biteffenms are niainty Imown &r their aiiti^^ 
spectnmi of vinises but also for dieir i^tective lole aganist some rxa^ 

25 potent iimmmomodnlators, possess direct antij^liferative activities and are cytotoxic or 
cytostatic for a number of different tnmour cell types. IFN^ is mainly employed as a standard 
ther^y for hairy cell IraiTfaemia^ metastasizing cardnoma and AIDS-assodated angjogeoic 
tumours of mixed cellularity known as bqxisi sarcomas. It is also active against a number of 
other tumours and viral infecticHis. For exan^ile, it is the current ^yproved therapy for chronic 

30 viral hq^atitis B (CHB) and C (CEC). The IFN-a subtype used for chronic vu^ hepatitis is IFN- 
al. About 40% of patients with CHB and about 25% of patients with OHC respond to this 
therapy with sustained viral deaiance. The usual doses of IFN-a are 5-10 MU (subcutaneous 
injection) three days per wedc &r4-6 nunrdis for CHB and 3 MU three days per week for 12 
months for CHC. Three MU of IFNa2 represent approximately 15 Gg of recombinant protdiu 
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ate m patients wifli diranic hqmtitb C 
ribavirin. This combinatiGii ther^y, which considerably increases the cost of die therapy and 
causes some additional sMe effects, results in sustained biochemical and virolog^cal remission in 
about 40-50% of cases. Recent data suggest that pegilated interferon in weekly doses of 180 Dg 
5 can also increase the sustained refuse rate to about 40%. lFN-a5 is the only IFN-a subtype 
expressed in liver; this cTqiression is reduced in patients with CHC and IFN-a5 seems to have 
one of the hi^iest antiviral activity in liv^ tumour ceils (see above). An international patent to 
use IFN-a5 has been filed by Frieto's group to fecilitat e commercial development (36)w 

10 BxjMnessioii systems: Human mteifeions are cutreiiify prepared in micarobial systems via 
recombinant DMA technology in amounts wbiek cannot be isolated fiom natuial sources 
0enkocytBS» fibroblasts^ lynqihocytes), IMfferent recombinant interfiaton-D genes have been 
cloned and egqpiessed in K cott (?7ajct) or yeast ^8) by several groups. Generally^ the 
synthesized protein is not conecdy folded due to the lack of disulfide bridges and therefore, it 

15 remains insoluble in inclusion bodies that need to be ^lubilized and refolded to obtain tiie active 
interferon (39»40). One of the most efficient methods of inter&nnHD expression has been 
published recently by Babu et aL (41). In this method, E. coli ceUs transformed with interferon 
vectors (regulated by ten^eiature inducible promoters) were grown in high cell density cultures; 
this resulted in the production of 4 g interferon-gyiiter of culture. Expression resulted 

20 exchisivi^ in die fiaxm of insoluble inclusion bodies which were sotubilized under denaturing 
conditions, refolded and purified to near homog^eity. The yidd of purified interferoo-D was 
approximately 3QQiDg/i of culture. Bqxressioa in plants via the nuclear gmKune bas not been 
veiy successfiiL Smimov et aL (42) obtained transformed tobacco plants with Agrobacterium 
^Um^/^denjusmgdieiulerferon-D gene und^35SCaKfV promoter but the e^qnession level was 

25 very low. Eldelbaum et aL (43) showed tobacco imclear ttansfonnation with Interferon-D and 
tiie esqnession level detected was 0.000017% of fiesh weigjit 

Challenges in commercial production: The nmnber of subjects infected witii liq[»atitis C virus 
(HCV) is estimated to be 120 milliGii (5 imDion in Europe and4 millian in USA). Seventy per 

30 cent ofthe infected people have abnomial fiver fimction and ab^ 

viral hepatitis or cirrfaosls. It mi^ be estimated however that diere are about 10,000-15,000 
cases of chronic infection with hqpatitis B vims (HBV) in Europe, a slightly lower number of 
cases in USA. hi Asia the prevalence of chronic HCV and HB V infection is very hig^ (about 110 
miUion of people are infected HCV and about 150 milHons are infected by HBV). In Afiica 

35 HCV infection is very prevalent Since mnemittiim chnmic viral hepatitis leads to liver cirriiosis 
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^ to Ever cancer» the higji prevalence of HBV and HCV infection in Asia and 
Aj^caaccoimtsforttieirYeryhig^inddenceofhepatoceUular catcm Based on these data, 
the need for IFNkx is large. IFN-<x2 is cuirraitly produced in microorganisms by a number of 
companies and the price of 3 MU (1 5 □ g) of recombinant protein in the western maricet is about 
5 $25. Thns, the cost of one year IFN-ct2 &eiq>y is about $ 4,000 per patient Hiis price makes 
this prodoct unavailable for most of tixe patients in the world suffering firom chronic viral 
hepatitis. Qeaxiy methods to produce less eg^ensive recombinant proteins via plant 
tnotechnob^ innovations would be cmcial to make antiviral therqyy widely available. Besides^ 
ifIPN-a5 is more effknenr than IPN-CC2, tower doses may be required^ 

10 

mSDUN-UKE GROWm FACTOR-1 QGBA) 

The IhsulinrlikB Growtti Factor protein, IGF-J^ is an anabolic homionB with a complex 
maturation process. A single IGF-I gene is transcribed into several mRNAs by ahecnatiye 
slicing and use of different tran3crq>tion initiation sites (44-46). Depending on die choice of 

15 splicing, two immature proteins are produced: IGF-IA, eTqaressed in several tissues and IGF-IB, 
mostly expressed in liver (45). Both pre^proteins produce the same mature protein. A and B 
immature fonns have different lengdis and co^^>osition, as their tamuni are modified post- 
tcanslationally by glycosj^ation. However, these ends are processed in ib& last step of 
mataiation. Mature IGP4 piotein is secreted^ not glycosylated and has thiee disulfide bonds, 70 

20 amino acids and a molecular wd^ of 7.6 kD (47-49). Physiologically, ICrF-I expression is 
induced by growdi hoocmone (CIH). Actually, the knock out of IGF-I in mice has shown that 
several fixncti<Mis attributed originally to CHI aie in fiict mediated by IGF-L GH production by 
adenohypofisis is r^uessed by feed-bade inhibition of IGP-L GH induces IGF-I syndiesis in 
different tissues, but mostly in liver, where 90% of IGF-I is produced (48). The IGF-I receptor is 

25 esqpressed in different tissues. It is fonned by two polypeptides: alpha that interacts with IGF-I 
and beta involved in signal transduction and also present in the insulin receptor (50,51). Huis, 
IGF-I and insulin activation are similar. 

Medical plications: IGF-I is a potrat mnltiinnctional anaboHc hcnmone produced in die livar 
30 upon stimulation by growth hormone (GHX Li liver drriiosb the reducdon of lecq^tom for Cffl 
in hqpatocytes and die dinmrishwl synttiesis of the liver parenchyma cause a progressive &E of 
senunlOF-I levels. Piatients wiOi livardrdiDsislmve anmnber of systeniic denrangemrats such 
as muscle atrophy, osteopenia, hypogonadism, protein-calorie malnutrition which could be 
related to reduced levels of diculating IGF-I. Recent studies from Pri^^s laboratoiy have 
35 demonstrated that treatments with low doses of IGF-*! induce significant improvements in 
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tus (52), intestinal absoiption (53-55), osteopenia (56), bypogonadism (57) and 
liver function (58) in mts witib e?q)ermie]ital liver ciniosis. These data support that IGF-I 
deficiency plays a pathogenic lole in several systemic coiiq>lications occuning in liver cirrhosis. 
The liver can be considered as an endocrine gland synthesising a honnone such as IGF4 with 
5 important physiological functions. Thus liver dnhosis should be vi^ved as a disease 
accompanied by a haaaooe deficiency syndrame for which replacement thenqjy with IGF-I is 
wananled. Climcal studies are in pocogress to ascertain the role of IGF-I in ihQ management of 
cinhotic padeots. IGF-I is also b^g cuirenffy used fox Lanm dwacEism treatment These 
patients lack fiver CR receptor so I(3P4 is net esqxressed (59). Also acting as a 

10 hypogjycemiant, is given together with insolin hi diabetes mellitas (60,61). Anabohc effects of 
IGF'I are used in osteoporosis treatment (62,63) hypercatabolism and starvation due to burning 
and HIV infection (64,65). Uiq[yublished Indies indicate tiiat IGF-I could also be used in patt^its 
wifli articular degjcneiative disease (osteoarttiritis). 

15 Expression Systems: The potency of IGF-I has encouraged a great number of scientists to tcy 
IGF-I expression in various mi cw oigy nisms due to the small amoimt present in human pksma. 
Pzodnction of IGF-I in yeast was shown to have several disadvantagiBS Uke low fermentation 
yieids and risks of obtaining undestrable ^^yoosylation in ttiese molecules (66). Expicssion in 
bacteda has been die most successM approach, either as a secreted form fused to protem leader 

20 sequences (67) or fused to a s61ubilized a:ffinity fusion proteui (68). In addition, IGF-I has been 
produced as insoluble inclusion bodies fused to protective polypeptides (69). Son-Ok Elim and 
Young Lee (70a) expressed IGF-I as a tnmcated beta-galactosidase fusion protein. The final 
purification yielded approxim ately 5 mg of IGF-I having native conformation per liter of 
bacterial culture. IGF-I has also been expressed in anrnialQ^ Zinovieva et aL (70b) reported an 

25 e3q[xre8sionof0.543mg(ml in rabbit milk. 

Chafien^ in commercial prodDdmi: IGF-I dcculates in plasma in afiurfyhig^ concentratiQn 
varying between 120400 ng/knL hi ctnhotic patients the values of 1(^-1 fall to 20 ngfval and 
fire^iently to undetectabk levels. Replacement Hbeaipy with IGF-I in liver dnhosis requires 

30 administiation of 1 J-2 n^ per day fiir each patient Urns, evoy cuihotic patient will consume 
about 600 mg per year. IGF-I is cnctently produced in bacteria (71). The hig^ amount of 
recombinant protein needed for IGF-I replacement therapy in patients with liver cirrhosis will 
make diis treatment exceedingly eoqiensive if new me&ods for che^ production of recombinant 
proteins are not devel(^>ed. Besides, as described above, IGF-I is used in treatment of dwarfism, 

35 diabetes, osteoporosis, starvation and hypen;atabolisnL IGF-I use in osteoarthritis is currently 
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ated Ag^ plant tnotedmobgy could provide a solalion to mate ecraomically 
jfeaable the ^Ucaticm of IGF-I therapy to all these patients. 

CHLOROPLAST GENETIC ENGINEERING 
5 When we developed die ccxicept of chloroplast genetic engineering (72,73), it was 

possible to intiodace isolated intact cfaloroplasts into protoplasts and regeaerate transgenic plants 
(74). Therefore, early investigations on chloroplast transformatian focused on the development 
of in organdlo systems using intact chloroplasts cq»ble of e£5ciexit and prolonged transocq^tion 
and translation Q5-Tf) and esqxression of fisreiga genes m isolated chloroplasts (78). Bowever» 

1 0 after die discovery of Ihe g^ gun as a tnmsfonnation device (79X it was possrble to transfonn 
plant chloroplasts without the use of isolated plastids and protoplasts. Chloroplast gen^ 
engineering was accomplished in several phases. Transieat er^iession of fbrciga genes in 
plastids of dicots (8031) was followed by such studies in monocots (S2). Ihiique to the 
chlorc^last genetic engineering is the cfevelopmciit of a foreign gene e^qiression system "CTng 

15 autonomottsly rephcating chloroplast e^ncssion vectors (80). Stable integration of a sdectable 
marker gsue into the tobacco chloraplast gearane (83) was also accamplished using Ihe gene 
gun. However, useful genes confarring valuable traits via chlaroplast genetic engineering have 
been demonstrated oofy recently. For exanqde; plants resistant to B.t smisitive insects were 
obtained by int^|«ting the aylAc gene into the tobacco chloroplast genome (84). Plants 

20 resistant to B.t resistant insects (up to 40JOOO fol^ were obtained by hyper-e^qpEression of die 
aylA gene within the tobacco chloroplast genome (85). Plants have also been genetically 
engineered via die diloroplast genome to confer herbicide resistance and the introduced foreign 
g^es were maternally inherited, overcoming the problem of out-cross wilh weeds (86). 
Chloroplast genetic engineering technology is currently beirig applied to dher useful crops 

25 (73,87)1 

c PRELIMINARY STUDIES . 

A remarkable feature of chlorq)last genetic erigincering is the observatian of 
30 exccptianally large accmiuilation of feonsigp protems in transgenic plants, as much as 46% of 
CRY protein in total soluble protein, even in bleached old leaves (3). Stable expression of a 
pharmaceutical protein in chlon^lasts was first reported for GVGVP, a protein based polymer 
with varied medical applications (such as the prevention of post-surgical adhesions aiKi scars, 
wound coverings» artificial pericardia, tissue reconstruction and programmed drug deMvery (88)). 
35 Subsequently, expression of the human somatotropin via die tobacco chloroplast ^ome (9) to 
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i of total soluble protein) was observed. The following investig^om tiiat are in 
progress in the Daniell laboratory illustrate the power of this technology to express small 
peptides, &atm operons, vaccines ftat lequiie oligomeric protdns with stable disotfide bridges 
and monoclonals that reqoiie assembly of heavy/lig|ht chains via chaperonlns. 
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20 Engineering novd pathways via the diloroplast genome: la plant and animal cells^ nnclear 
mRNAs are translated monocistronically. Tins poses a serious problem whKi engineering 
multiple genes in plants (91). Hietefoie, in order to express the polyhydroxybutyrate polymer or 
Guy's 13 antibody, single g^ies were first introduced into hxdmdoai transgenic plants, then 
tiiese plants were back-crossed to reconstitute ttie entire pathway or the conqylete protein (92,93)» 

25 Similarly) in a seven year long effort Ye etaL (81) recently 

short biosynthetic padiway that resulted in ^-carotene expression in rice. In contrast, most 
diloroplast g^es of faigjher plants are cotransoibed (91), Bqxression of polycistmiis via the 
chloroplast genome provides a uniqpie opportunity to ejqpiess entire padiw^ in a single 
transformation event We have recently used tef'adiliis/Aurte^ 

30 model system to demonstrate operon compression and crystal formation via die cfaloroplast 
g)5nome(3). Ov2Aa2 is the distal gene of a tfire&.gene qienwi. The /Mgr irmnftrtifttftly ^st^im 
crylAal codes for a putative chaperonin that Militates tiie folding of crK2Aa2 (and other 
proteins) to form jHoteolyticaUy stable cuboidal crystals (94). 
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)re. Hie cry2Aa2 bacterial operan was expressed in tol>acco chloroplasts to test the 
resultant transgenic plants for incxeased expresskm and impioved persistence of die accumulated 
insectiddal protem(s). Stable foreign gezie integration was confinned by PGR and Southern bto^ 
analysis in To and Ti transgenic plants. CrylAol operon derived protein accumulated at 453% 
5 of die total soluble protein in mature leaves and remained stable even in old bleached leaves 
(46.1%XPigaie 1). This is die highest level of foreign gene expression ever reported in 
tiansgienic plants. Bxceedingily difficult to control insects (iO^y old cotton boUworm, 
beetanny worm) were killed 100% after consuming transgenic leaves. Electron micrographs 
showed the presence of the insectiddal protein folded into cuboidal crystals similar in shqm to 

1 0 Cry2Aa2 crystals observed in Baalhis ^turinffeiK^ (Figgie 2). In contrast to cncrently maiketad 
transgenic plants witfi sohiUe CRY protems^ folded protoxin crystals will be processed only by 
target insects that have alkaline got pH; this approach should im|»Dve safety of Bt transgenic 
plants. Absence of insecdcidal proteins in transgenic pollen eliminates toxici^ to non-taiget 
insects via polkn. In addition to these environmentally fiiendly approaches* tiiis obser^on 

1 5 should serve as a model system for large-scale production of foreign proteins widiin chlcxoplasts 
in a folded configuration enhancing dieir stability and &ralrtating single step purification. This is 
the first demonstration of expression of a bacterial apemi in trazsgBiiic plants and qpens the 
door to engineer novel padiways in plaiits in a sing^ transforinatto 

20 Expressing small peptides via the chloroplast genome: It is common knowledge diat the 
medical commnnity has been fighting a vigorous batde against drug resistant pa&ogenic bacteria 
for years. Catianic antibacterial peptides fiom mammals aiiq)hibians and insects have gained 
more attention over the last decade (95). Key features of these catianic pq>tides are anet positive 
charge^ an afBnity for negatively-charE^ prokaryotic membrane phosphoiqpids over neutral- 

25 charged eukaiyotic membranes and tibe abihty to form aggregates that disrupt tiie bacterial 
membrane (96). 

There are three m^'or peptides with a-4ielical structures^ cecropin finom Hyalopkora 
cearopia (giant silk moth), tnag^hrins fasmXenopus Jaevis (Afdcan fiog) and ^^fffengjn? fiom 
30 mammalian neutrophilSL Magamin and its analognes have been studied as a broad-spectrum . 
topical agents a systemic antibiotic; a wcund-healing stimulant; and an anticancer a^t (S^. We 
have recently observed diat a synthetic lytic peptide (MSI'99» 22 amino acids) can be 
successfolly expressed in tobacco chloroplast (S>8). The peptide retained its lytic activity against 
the pfaytopathogeoic bacteria Pseudomonaa syringae and multidrug resistant human pathogen. 
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aerugptosa. The anti-microbial peptide (AMP) used in Has Study vna an 
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commonly found in tiie cuter-memlnane of bacteria. Upon contact with these membranes, 
20 indtvidoal pqilides afficegate to fimn pores in the membrane, resulting in bactaial 'lysis. 
Because of the concentration dependent action of the AMP, it was caressed via tb& chioroplast 
genome to acconq>lish high dose deliveiy a! the point of infection. PGR products and Southern 
blots confirmed diloroplast int^ration of die foreign genes and homoplasmy. Growth and 
develqpment of the transgenic plants was unafifected by hyper-expre^ion of the AMP within 
25 chlotoplasls. In vitro assays with To and T| plants confirmed that the AMP was exptes&eA at 
hi^ leveb ^1^ to 43% of die total soluble protein) and letatned biological activity against 
Pseiidomtmas syringae, a major plant palhogien. In situ assays resulted in intense areas of 
necrosis around the point of infection in control leaves, while transformed leaves showed no 
signs of necrosis (200-800 }xg of AMP at die site of in&cdmX^gqre 3). Ti in vitro assays 
30 against Pseudcmonas aerugfiwsa (a nmlti-drog resistant hnman pathogen) dt^layed a 96% 
inhibition of growth (Figure 4). These results gnre a new optim in the batde against 
phytqpathogenic and drug-resistant human pathogenic bacteria. Small peptides (like insulin) are 
degraded in most organisms. However, stability of this AMP in cbloroplasts opens \q> dus 
compertment for exjsession of hoxmones and other small peptides. 

35 
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' cholera todn ^ subanit ol^niiers as a Yaccine in dttoroplasts: Vibrio 
ckoteraey wMch causes acute water/ dianitea by ootonizmg^ 

enterotoxin, cholera toxin (CT). Cholera toxin is a hexameric AB5 protein oonsistmg of one 
toxic 27kDa A subunit having ADP ribosyl transferase activity and a nontoxic pentamer of 1 1.6 
5 kDa B subunits (CTB) that binds to the A subunit and ^cilitates its entry into &e intestinal 
epithelial cells. CTB -wbsn administered orally (99) is a potent mucosal rmmunogoi which can 
neotialize the toxicity of the CT holotoxin by preveoting it fo>m binding to ^ intestinal cells 
(100). This is believed to be a result of it binding to aikaiyotic cell soi&ces via fts& G]^i 

gangliosides^ Tecq[ytors present on the intestinal ^tfaelial surface^ flujs elicitiiig a mucosal 
10 rnimxing icqKmse to patbog^is (101) and enhancing the imrmme rcqxinse when chonicaUy 
cotq[>ledto otiier antigens (102-105). 

Quoleia toxin (CIB) has previonsly been eiqiiessed in nuclear tran^enic plants at levels 
of 0.01 (leaves) to 03% (tubers) of the total soluble {»x>tein. To mcrease expression levels, we 

15 engmeered the chloroplast genome to express the CTB gene (10). We observed es^ressLon of 
oligomeric CTB at levels of 4-5% of total soluble plant protein (Figure 5A). PCR and Southern 
Blot analyses confirmed stable integration of the CTB gene into the chloroplast genome. 
Western blot analysis showed ttiat transgenic chloroplast expressed CTB was antigenicaliy 
ulcotical to commennally available purified CIB antigen (Figure 6). Also, GMl-gBi^gUoside 

20 iHnding assays confirm that chloroplast ^thesized CTB bmds to the intestmal mranbrane 
receptor of diolera toxin (Figure SB). Transgenic tobacco plants were morphologically 
indistingaishable fiom tmtransfbmiBd (toils and the introduced gene was found to be stably 
inherited in the subsequent generatioa as confirmed by PCR and Soudiem Blot analyses. Ibe 
increased production of an efficient transmuoosal earner molecule and deHvaiy systEOOy liko 

25 CIB» in chloroplasts of plants makes plant based gtsX vaccines and fusion protdns wi& CTB 
needmg oral admiriistcatLoii, a iimch more feasible ^^noat^ This also establishes unequivocally 
that diloroplasts are ciq^able of forming disulfide hddlges to assemble foreign proteins. 

Expressicm and assembly of monodonals in transgenic chloroplasts: Dental caries (cavities) 
30 is probably the most prevalent disease of hnmanlrind. Colonization of teeth by £ mutans is the 
sirig^ niost tiiqiQrtarit risk &ctor in die <tevd(^^ im<liem is a non-motile^ 

gram positive coccus. It cokmizes tootti sur&ces and syntiiesizes ghicans (insoluble 
polysacdbaride) and finctans fiom smaose using the enzymes ghicos^tiansferase and 
&uctosyltrans&rase respectively (106a). The ghicans play an inqxntant role by allowing the 
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dhere to tike smooth toofk smfiices. After its adherence, die bacterium fenneats 
sucrose and produces lactic add. Lactic add dissolves the minerais of the tootii» producing a 
cavity. 

A topical mcmoclcsial antibody therapy to prevent adherence of 5. mutans to teedi has 
5 recently been developed. The incidence of cariogenic bacteria (in humans and animals) and 
dental caries (in animals) was dramatically reduced for periods of iqp to two years after the 
cessation ofthe antibody tfaer^. No adverse events were detected eatfaer in the etxpo^ 
or in human vohmteers (106b). The annual requirement for this antibody in tiie US alone may 
eventually exceed 1 metric ton. Therefore^ this antibody was eoqiressed via the diloroplast 

10 gmme to adiieve faig|ier levels of e^qpression and proper folding (11). The integration of 
antibody genes into the chloroplast genome was confirmed by PGR and Soulhem blot analysis. 
The expression of both heavy and li^ chains was confirmed by western blot anal3rsis under 
reducing conditions (Hgaie 7A3)- Ihe e^qnession of fiilly assembled antibody was confirmed 
by western blot analysis under non-reducing oonditLons (Figure 7C). This is &c first report of 

15 succcssfiil assembly of a nmlti-subumt human protein in transgenic ddoroplasts. Production of 
monodcmal antibodies at agricultural level should reduce fbm cost and crcatb new a{^cations 
of monoclonal antibodiesi 

HUMAN SERind ALBIJMIN 

20 Nndear transformalioii: Recen^, Dr.'s Mmgo-Castel gnvup in Spain (a Co-PI in this proposal) 
cloned the human HSA c^NA firm human liver cells and fiised the patatxn promoter (whose 
expression is tuber qpedfic (107)) along with the leader sequence of PIN II (proteinase n 
inhibitor potato transit peptide that directs HSA to tiie i^oplast ( 1 OS)) . Leaf discs of Desiree and 
Kennebec potato plants were transformed using Agrobactermm tumefadens. A total of 98 

25 transgenic Deairee ckmes and 30 Kennebec clones were tested by PGR and western blots. 
Westmi blots ^wed ttiat the lecooibiiumt albumin (rHSA) had been properly deaved by the 
proteinase n inhibitor transit pqitide ^gnre 8). Ejq[xression levels of boUi cdtivazs were very 
difierent among all transonic dones as expected (Figure 9X probably because of positkm effects 
arxl gene gilmciTig (89^). The pqmlaticKi distributicm was similar ui both cuMvais: majority of 

30 transgenic dones showed esqiression levds between 0.04 and 0.06% of iHSA in the total soluble 
protein. The maxiinum cccombmant HSA amount expressed was 0.2%. Between one and five T- 
DNA insertions per tetr^loid genome were observed in these clones. Plants widx higher protein 
egression were always clones witti several copies of ^e HSA gene. Levels of mRNA were 
analyzsed by Northern l^ots. There was a correlation between traiiscrq>t levels and recombinant 

35 albumin accumulation in transgmiic tubers. The N-termizzal sequence showed proper cleavage of 
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ide and amino tenninal sequence between reccmbmaiit and human HSA was 
identicaL Inhibition of patatin &qptcssian using tiie antisense technology did not inqnnove the 
amount of rHSA. Average e^qpiession level among 29 transgenic plants was 0.032% of total 
soluble protein, with a TnaTtminn expression of 0.1%. 

5 

CMoropIast transformalion: We have also initiated tiansfonnation of the tobacco chlorpplast 
genome for hypero^ressLon of HSA. The codon conq>osition is ideal fi^ 
and no dianges in nacleotide sequences were necessaiy (see section d3). For all the constructs 
pLD vector was used (see description in sectioEn d.4). We destgned sevexal vectors to optimize 
10 HSA ejqxressira. All titese contain ATG as the &st amino addofllieniataie protein. 

1 - RBS-ATG-HSA: The first vector includes the gene that codes for the mature HSA plus an 
additional ATG as a translation initiatian codon. We included tiie ATG in one of die primers of 
the PGR, 5 nucleotides downstream of the chknoplast prefbied RBS sequence GQAGG, The 

15 cDNA sequence of the mature HSA (cloned in Dr. Mmgo-CastePs laboratory) was used as a 
tpiEHilfltfe The PGR product was cloned into PGR 2.1 vector, excised as an EcoRI-NotE fiagment 
and introduced into tiiB pLD vector. 

2- 5XrrRpsbA>ATCF>HSA: The 200 bp tobacco chloroplast DNA fiagmmt contaming the S' 
psbA UTR (untranslated regiioci; see section d3) was anplified using PGR and tobacco DNA as 
ten^late. The fiagment was cloned into PGR 2.1 vector, excised EcoRI-NcoI fragment was 
inserted at the Ncol site of the AT&-HSA and finally inserted into the pLD vector as an EcoRI- 
Notl fiagment downstream of the 16S iRNA promoter to enhance translation of the protein. 

3- BtORFl+2-ATG-HSA: ORFl and ORF2 of the Bt Cry2Aa2 open>n (see sectkm c and d3) 
were amplified in a PC^ using die complete c^jeron as a template. The fiagment was cloned into 
PGR 2.1 vector, excised as an BcoRI-BcoRY fiagment, ins^ted at EcoRV site with die ATG- 
HSA sequence and introduced into the pLD vector as an EcoRI-Notl fragment The ORFl and 
ORF2 were fused upstream of die ATG-HSA. 

Because of the sunilarity of piDtran syndiedc machinery (109), esqnession of aO 
chloroplast vectors was first tested in E.coh before their use in tobacco transfcHmaticm. Different 
levels of expression were obtained in K coli dqiending on the construct (Figure 10). Using the 
psbA 5' UTR and die ORFl and ORF2 of the cry2Aa2 operon, we obtained hig^ levels of 
expression than using only the RBS. We have observed in previous experiments that HSA in E. 
coh is CQnq>letely insoluble (as is shown in lef 14), probably due to an improper folding resulting 
fiom die absence of disulfide bonds. This is die reason why the protem is precqntated in the ^1 
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ifiereut polypeptide sizes were observed^ probably due to incomplete tianslatioiL 
Assuming that K coU and chlorc^last have similar protein synthesis madiinery, one could expect 
different levels of ^qnression in transgenic tobacco chloroplasts depending on the regulatory 
sequences, with the advantage that disulfide bonds are formed in chloroplasts (9). These diree 
5 vectors were bombarded into tobacco leaves via particle bombardment (110) and after 4 weeks 
small shoots appeared as a result of indqiendent transfimnation events. CSiaracterization of these 
tiansfoimants is in progress. Progress report will be provided when the panel meets. 

/ 

INTERFERON-oS 

10 Interfbron-otS has not been expressed yet as a commercial recombinant protein. The first 

attempt has been made recently in Prieto's laboratcxy. The IFN-a5 g»ie was doned and the 
sequence of tiie maluie fupotdn was inserted into the pETZS vector, that mcluded the ATG, 
histidine tag for puiificaticm and thrombin cleavage sequences. The lagged IFN-oS was purified 
first by binding to a nickel column and biotinylated thrombin was then used to climmate dtctag 

15 on IFN-a5. Biotinylated thrombin was removed from the preparation usmg streptavidin agarose. 
The eapression level was 5.6 micrograms per liter of broth culture and the reccnnbinant protein 
was active m antiviral activity similar or higher than commercial IFN-€i2 {}rAxoa Sobering 
Plouth). 

20 INSinjIN-IJ]aBGRO\VTHFACTOR-I^^ 

Recent studies in Itieto*s laboratory have demonstrated that treatment with low doses of 
IGF-I induced sigpificant unprovements in nutriticxial status (52X intestinal absorption (53-55), 
osteopenia (56), hypogonadism (57) and liver fimcdon (58) in lats with esqierimental liver 
cirrhosis. These data support that IGP-I deficiency plays a pa^genlc role hi sseveral systemic 
25 complications occurring in liver ctnhosis. Clinical studies are ui progress to ascertain the role of 
IGF4 in the management of cirrhotic patients. Unpublished studies indicate that IGF-I could also 
be used in pati^ts with articular d^eneiative disease (osteoarthritis). 

d. RESEARCH DESIGN AND METHODS 

30 d»l Evalnatioii of ehloroi^ast gene expression: A systematic iqiproach to identify and 
overccMne potential lunitations of foreign gene e)q>ression m chloroplasts of transgenic plants is 
essential. Information gained in this study should increase the utility of chloroplast 
tiansfixmatioD ^^stcan by scimtists interested in esqiressmg odier foreign protems. Therefore^ it 
is hnportant to ^strariatically analyze transcription, RNA abundance, KNA stabilify; rate of 
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)f the introduced HSA gene wiU be compaac&i wifh the hig^ e3q>iess]ng 
endogenous (Mowplast gpoes (rbcU psbA, 16S iKNA), using run on tzanscnption assays to 
determine if tiie 16S^NA promoter is operating as expected. Transg^c cMoroplast containing 
eachofthe three constnicts with different 5' regions (see preliminaiy studio 
5 investigated to test their transcription efficiency. Similarly, tiansgene RNA levels will be 
monitcied by ncnthems, dot blots and primo^ extension relative to endogenous ibcL» 16S iRKA 
or psbA. These results akiig with mn on Iraascrqition assays should provide valuable 
infarmatiQn of RNA stability, piocessing» etc. Widi our past eaq>edence in expression of several 
foTDign genes, KNA appears to be extremely stable based on nortfaem blot analysisL However, a 
10 systesnstic stndty would be vahaable to advance utility of Hsis system by other sdeotists. Most 
in^Kirtantiy, tiie efficient of translation will be tested in isolated chlorpplasts and conqmred 
with the hig^ translated chloroplast protein (psbA). Pulse diase experiments would help assess 
if translational pausing^ premature termination occurs. Evaluation of percent RNA loaded on 
polysomes or in constructs with or without 5*UTRs would help determine the cfEciency of tibe 
15. nbosome binding site and 5' stem-loop translational enhancers. Codon optimized genes (IGF-I, 
IPN) will also be conqiaied wifii unmodified genes to investigate the rate of translation, pausing 
and termination. In our recent experience, we observed a 200-fold difTeraice in accumulation of 
foreign proteins due to decreases in proteolysis ccmfenned by a putative di^ieronin (3). 
nierefbre, protems fiom constructs exinessing or not es^ressmg the putative chqteronin (with or 
20 widxiut ORPl+2) should provide valuable information on protein stability. Thus, all of this 
information will be used to inqyrove the next generation of dhloniplast vectois. The PI has 
extensLve 6q>eri«ic6 in analysis of chlon^last gBoe esqxress^^ 

d«2 Expression of the mature protein: HSA, Interferon and IGF-I are pr^-proteins tiiat need to 
be cleaved to secrete mature pK>tm& The codoo for translation imtiBtion is in the prese^ience. 
in chloroplasts, the necessity of esiqpressing die mature protein would introduce Has additional 
ammo acid in coding sequences, in onler to optimize ex^^ 

sequence of the mature piotetns beginning with an ATQ. Subseq;oent immunolo^ad assays in 
mice vnXi be done with those proteins to investigale if the extra-methionine can cause 
immunogBnic re^xmse or low Inoactivity. AHematrvely, we will develop systems to produce the 
matnre protein. These systems can include the syndiesis of a protein fused to a peptide that is 
cleaved intracellulary (processed) by chloroplast enzymes or the nse of chemical or enzymatic 
cleavage alEter partial purification of protdns fiom plant cells. 
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!S that are cleaved in chloroplast: Staub et aL (9) iqpoited chloroplast Gxpiession 
of hiiman somatotropin stniilar to the native human protein by using ubiqnitih fosions lhat were 
cleaved in the stroma by an ubiquitin protease. However, the processing efELdency ranged from 
30-S0% and die cleavage site was not accurate. In order to process chloroplast esqiressed proteins 
5 a peptide which is cleaved in the stroma is essential. The transit peptide sequence of die 
RuBisCo (ribolose l>S-bi^osphate carboxylase) small subimit would be an ideal choice. Tins 
transit pq>tide has been studied in dep& (11 1). RuBisQy is one of die p^ 
in cytoplasm and transported postianslatianaHy into the chloroplast in an energy dependent 
process. Hie transit peptide is proteolyticalfy removed \xgan. tmnspoit in the stroma by the 

10 stronoal processing peptidase (112X These are several sequences described for different q^ecies 
(1 13). A transit peptide consensus sequence for the RuBisCo small subunit of vascular plants is 
published by Keegstra et aL (114). The amino adds diat axe proximal to the C-teminal (41-59) 
are hi^y conserved in the higher plant transit sequences and belong to die damam which is 
involved in enzymatic cleavage (111). ThcKuBisCo small subunit transit peptide has been fused 

15 widi various maxker proteins (114»US), even widi animal proteins (116»117)» to target proteins 
to the chlaroplast Pncr to transformation studies, the cleavage efBciBncy and acconicy wiD be 
tested by m vUro translation of die fhsian proteuis and in crganeBo mspast studies using intact 
chlorc^lasts. Once we know the correct fosion sequence for produdr^ the mature protein, such 
sequence encoding the amino tenninal porticm of tobacco chloroplast transit peptide will be 

20 linlced widi the mature sequence of each protein. Codon composidon of the tobacco RuBisCo 
small subunit transit pqitide i^ipears to be compatible with chloroplast optimal translation (see 
secdon d3 and table 1 on page 30). Additirmal transit p^de sequences for targeting and 
cleavage in die chloroplast have been described (111). If we found that the RuBisCo small 
subunit transit pepdde is not suitable^ odier transit peptides widi cleavage in stroma will be 

25 studied The hxmen of diylakoids could be a gpod target because diylakoids are easy to purify. It 
is relatively easy to free hnnenal proteins etifaer by sonkation or with a vety low tritcm XlOO 
concentration. However, this may require insertion of additional amino add sequences for 
e£BcieotinqpK>rt(lll). 

30 Use of dmnical or enzymatk cleavage: The stralpgy of fusing a protein to a tag with aflSmify 
for a certain Ugand has been used extensively for mi»e dian a decade to enable afBnity 
punScadon of recombinant products (118-120). A vast number of cleavage mediods, both 
chemical and enzymatic^ have been investigated for this purpose (120). Chemical cleavage 
methods have low specificify and die relatively harsh cleavage conditions can result in chemical 

35 modifications of die rdeased products ( 120)« Some of the en^matic methods offer significantly 
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3 q>ecificities together with higji efEdency, e. g. H64A subtiHsin, IgA protease 
and &ctDr Xa (1 19»120), but these &aymss have tiie drawback of being quite expensive. 

Trypsin^ whidi cleaves C-tenninal of basic amino-acid residues, has been used for a long 
time to cleave fusion piotdns (14,121). Despite expected low specificity, trypsin has been shown 
5 to be usefiil for specific cleavage of fiision proteins, leaving basic residues within folded protein 
domains imcleavag^ (121). The nse of trypsin only requires that the N-teimimis of the mature 
protein be accessible to the protease and that die potential internal sites are protected in the 
native conformation. Trypsin has ^ additional advantage of beiog ine3q>eQsive and readily 
avai]abl& lii the case of HSA, vften it was eaqnessed in coli wifii 6 additional codons coding 

10 for a tiypsin cleavage site, HSA was processed successfoUy into the mature jxotein after 
treatment widi die protease. In addition, the >Merm2nal seqoence was faaaA to be unique and 
identical to the sequence of natural HSA, the conversion was 0Qizq>Iete and no degradation 
products were obsoved (14). This in vitro maturation is selective because correctly folded 
albumin is highly resistant to trypsin cleavage at inner sites (14). This system couldbc. tested fi>r 

15 chloroplasts HSA vectors using pnotdn expressed m^. coli, 

Staub et aL (9) demonstrated Oat die chlorpplast mednomne aminopeptidase is active and 
diey found 95% of removal of the fiist methicmine of an ATO-somatotropin protein that was 
expressed via the chloioplast gienome. There are several investig^ans that have shown a very 
strict pattern of deavag^ by this peptidase (122). Methionine is <nily removed when second 

20 residues are ^lydne, alanine^ serine, ^rsteine^ threonine, proline (»: valine, but if to third amino 
acid is proline the cleavage is inhibited. In the e^qiression of our three proteins we could use this 
approach to obtain the mature protein in the case of Interferon because the penultimate 
aminoadd is cystein followed by aq>artic acid. For HSA the second aminoacid is aspsactic acid 
and for IGF-I gl^dne but it is fbllowedbyprolme, so the cleavage may not be possible. 

25 For IGF-I expression, the use of the TBV protease (Giboo cat n 10127-017) would be 

ideal. The cleavage site diat is recognized for ^ protease is GtiHAsn^^ 
it cuts between Gln-CHy. This strata would aUow the release of the mature protein by 
incubation widi TEV protease leaving a glycine as the first amino add consistent with human 
mature IGF-I proteiiL 

30 In the £1 cott Inter&ron-DS caqncssion mediod developed in Dr. Frieto's laboratoiy (see 

section C), the purification system was based on 6 Histidine-tags that bind to a nidcel cotumn 
and biodnylated thrombin to eliminate die tag on IFN-DS. Thrombin recognizes Leu-Val-Pro- 
• Arg-Gly-Ser and cuts between Arg and Gly. This would leave two extra amino acids in the 
mature protein, but antiviral activify studies have been done showing that this protein is at least 

35 as active as commenaal IFN-D2 (m^ublished data m Dr. Prieto*s laboratory). 
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d3 Optimization of gene expresdon: We have reported that foreiga g^ies are expressed 
between 3% (cry2Aa2) and 47% {cry2Aa2 opm>n) in transgenic chloroplasts (3,85). Based on 
die outcome of die evaliiation of HSA chloroplast transgenic plants, several approaches will be 
5 nsed to enhance transladon of die recombinant proteins. In chloroplasts, transcripdonal 
regolation of gene expression is less important, although some modulations by light and 
devdopmentBl conditions aie observed (123). RNA stability ^eaxs to be one among the least 
ixoblems because of observatum of excessive accumulation of foreign transctqyts, at times 
l^S^66-fdd hig^ than tfae faiglily expcessing nuclear transgenic plants (124). Chloroplast gme 

1 0 eiqxession is xegulated 1o a iargje exSaasd at the po^-transcrqitional leveL F(Mr eacample, 5' UTRs 
are necessary for optimal translation of chloroplast mRNAs. Shine-Dalgamo (GGAGG) 
sequences as well as a stem-loop structure located S* adjacent to tfae SD sequoice are required 
for efiBcient translaticm. A recent ^udy has shown that insertion of the psbA 5' UTR 
downstream of the 16S cRNA promoter oihanccd translation of a foreign gene (GUS) hundred- 

15 fold (125a). Thcax;fm:c» die 20(H)p tobacco chloroplast DNA fiagmcnt (1680-1480) containing 
5* psbA UTR will be used. Thb PGR product will be insoted downstream of die 16S rRNA 
pcomoter to enhance translation of the recombinant {Hoteins. 

Yet another approach for enhancement of trandation would be to pptimi2e codon 
compositions. Since all tbe dxree jMXitdns are translated m E. colt (see section b), it would be 

20 reasonable to expect efiBcient expression in chloroplasts. However, optimizmg codon 
conqx)sitions to match the psbA gene could foither enhance the level of translation. Although 
rbcL (RuBisCX>) is the most abundant protein on earth, it is not translated as highly as the p^A 
gene due to the extr»nely hi^ turnover of die psbA graie product The psbA gene is under 
stronger selection for increased translation efficiency and is the most abundant thylakoid protein. 

25 In additicm, the oodon usage in higher plant chloroplasts is biased towards die NNC codon of 2- 
fold degenerate groiqjs (Le. ITC ovw TTT, GAC ova: GAT, CAC over CAT, AAC over AAT, 
ATCov^ATT.ATAetc.). llus is in additi(»i to a strong bias towards T at durd position of 4- 
fold degenerate groiq[K& There is also a oontB^ efifoct that should be taken into consideration 
while modi^ing specific codons. The 2-fold degenerate sites inomediatBly iqistream fiom a 

30 GNN codon do not show ttns bias towards NNC. (TTT QGA is preferred to TTC GOA vidule 
TTC OGT is preferred to TTT CGT, TTC AGT to TTT AGT and TTC TCT to TTT 
TCT)(125b,126). In addition, highly esqiressed chloroplast genes use GNN more fiequendy diat 
other genes. The web site ht^:/Avww^1cagi35a.or.ip/codon will be used to optimize codon 
composition by comparing diSerent species. Abundance of amino acids in chloroplasts and 

35 tRNA anticodons present in chloroplast will be taken into consideration. We also compared 
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: of all foreign genes tiiat had been expressed in transgenic chloroplasts in our 
laboratory widi the percentage of chloroplast eTqnression. We fonnd tiiat higher levels of A+T 
always correlated with higjh expression levels (see table 1 on page 30). It is also possible to 
modify chloroplast protease recognition sites while modifying codons, without affecting their 
5 biological functions. 

The study of Ifae sequences of HSA, IGF-I and Interfenm-DS was dona The HSA 
sequence showed 57% of A+T cantent and 40% of the total codons matched wi& the psbA most 
translated oodons. Acooidu^ to the data of table 1, we should esqtect good chloroplast 
aqntession of the HSA ^ne wifliout any modiOcatioiis in its codon composition. IFN-05 has 

10 54%of A+T c(mtBxit and 40% of matching with pshA codons. The conqx^^ 

but this protein is small (166 amino acids) and it wonld be easy to optim^ the sequence to 
adueve A+T levels close to 65%^ Finally, tiie analysis of die IGF-I sequence showed that the 
A+T content was 40% and only 20% of die oodons are die mo^ translated in psb A. Therefore^ 
this gpnc needs to be optimized, OptirmzatiQn of these two genes will be done using a novel PGR 

15 approach (127,128) wbich has been successfiilly used in our laboratory to optimize codon 
composition of odier human proteins. 

d4 Vector constrnctions: For all the constructs pLD vector wiH be used. This vector was 
developed in this laboratory for chloroplast transformation. It contains the 16S xRKA promoter 

20 (Prm) driving the sdectable marker gene oadA (aminoglycoside adenyl transferase conferring 
resistance to q;>ectinomycin) followed by the psbA 3* region (die tenninator fiom a gene coding 
for photosyst^n II reaction center components) from the tobacco chloroplast genome. The pLD 
vector is a universal chloroplast e^qxression ^te^ration vector and can be used to transform 
ddoroplast genomes of several odier plant species (73,86) because these flanking sequences are 

25 hi^bly conserved among hi^3» plants. The universal vector uses tmA and tml genes 
(chknoplast transfer RNAs coding for Ahmtne and Isolendne) fiom die inverted repeat region of 
die tobacco chloroplast genome as flanlring scqucaces for homologous recombination. Because 
die universal vector integrates foieiga genes widiin the inverts diloroplast 
genome^ it sboM double die copy number of die transgene (fiom 5000 to 1 0,000 cc^ies per cdl 

30 in tobacco). FmtlKimore^ it has beimdanoDstiatedtl»thDmgplas^ 

round of selection in tobacco probaUy because of the presence of a chloroplast origm of 
replication within the flanking sequence in the universal vector (tii^by providing more 
tenqtbtes for integration). Because of these and sevraral other reasons, foreign gene expression 
was shown to be inuchlugher when die univosal vector was used instead of the tobacco specific 

35 vector ^8). 
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We will design the foUonrai^ vectofs tD punficaitoik and production 

of pioteins with the same ainino add oanqx)6itioa as in 

5 a) In Older to optimize expression we will increase translation using the pshA S^UTR (see 
section d.3) and opthnizing ^ codon conq>osition for protein ex|s:essioQ in chloioplasts 
accxxrding to cnteria discussed in section d3. The 2(X> bp t^ 

containing 5* pshA UIR will be an^ilified by PGR usmg tobacco dilorc^la^ DNA as 
tBfnplate. lliis fiagment will be cloned dbecAy in the pLD vector nmltqile cloning site 
10 (EeoRl-NcoI) downstream of the promoter and the aadA gene. Tbo cloned sequence will be 
exact}/ ^ same as in die psbA gene. 

b) For enhancing protein stability and &cilitaling purification^ die cry2Aa2 BadUus 
ffttdringiensis opmm derived putative di^ierDnin will be used. E:iq>iession of the oylAal 

15 operon in chloroplasts provides a model ^stem for hyper-expression of foreign proteins 
(46% of total soluble protein) in a folded configuration enhancing their stability and 
fecilitnting pmificalion (3). Ibis justifies inclusion of the putative chqienmin fiom the 
€ry2Aa2 openxi in one of the newly designed constructs. In this region tiiere are two open 
reading ftanies(ORFI and ORF2) and a libosoxnal binding site (rfos). This sequence contaiiis 

20 elements necessary fi>r Qy2Aa2 ciystallization which may he^ to crystallize the HS A, IGF-I 
and IFN-0 protems aiding in the sub^uent purification. Successfiil ciystallization of otiier 
proteins using tins putative diaperomn has been demonstrated (94). We will amplify the 
ORFl and OKF2 of the Bt Cfy2Aa2 operon by PGR using the complete openm as tenq)late. 
The fiagmrat will be cloned into a PGR 2 . 1 vector and excised as an EcoRI-EcoRV product 

25 This fiagment will be cloned direct^ into die pLD vector nniltq>le cloning site (BcoRI- 
EcoRV) downstream of die promoter and die aadA ^ne. 

c) To obtain proteins with the same acid conqiosition as mature lumnan proteins, we will 
first fiise all three genes (codon optimized and native sequoice) widi the RuBisCo small 

30 subunit transit pqitide. Also other constnictions will be done to allow cleavage of the protein 
after isolation fix>m ddorophst (see section d2). Ihese strategies would also allow affinity 
pnrification of the proteins. 



35 



The first set of constructs will inchide die sequence of each protein beginning with an ATG, 
mtrodnced by PGR using prim^ Once we achieve optimal expression levels, and if the ATG is 
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>roblem (detennined fay mice immunological as^ys), processing to get the mature 
protein will be addressed. The first attenq)t will be the use of the RoBisCo small subimit transit 
peptide. This transit peptide will be amplified by PGR using tobacco DNA as teiiq>late and 
cloned into the PGR 2.1 vector. All genes will be fiised with the transit peptide using a Mlul 
5 restriction site that will be introduced in tbe PGR primers for amplification of die transit p^tide 
and genes coding for tliree proteins. The gene fiisions wiU be inserted into the pLD vectors 
downstream of fte S*UTR or ORFl+2 using the restriction sites Ncol and EcoRV re^)ectively. If 
use of tags or pfotease seqamoes is necessaiy, such sequences will be inliodaced by ckssism'ns 
primers including diese sequences and amplifying die gene with POL After completing vector 
10 constructions^ all the vectors will be sequenced to confirm correct nucleotide sequence and in 
fiame fusiorL DNA sequencing will be done using a Peridn Elmer ABI prism 373 DNA 
sequenc i ng system, 

CDndm 

r 




S*utr|ntoA 



^ natNeorcodonoptlnilzBd Pw rnstOonb rn 



Because of tlie similarity of protein ^nttietic madixnery (1Q9)» agression of all 
ddoroplast vectois will be first tested in iSLco& before thrir use in tobacco txansfonnatimL For 
30 Escherichia colt expression XLrl Blue strain will be used E. coli will be transformed by 
standard CaCli transformation procedures and grown in TB culture media. Purificatioii, 
biological and immunogBuic assays will be done using J5L coU expressed proteins. 

Bombardment, Regeneration and Characterization of Chloroplast Trani^enic Plants: 
35 Tobacco {NtcoHana tabacum var. Petit Havana) plants will be grown asqyticalfy by germination 
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SO mediunL Tins medhim contains MS salts (4.3 gftiterX B5 vilamm mixtuie 
(myo-inositol, 100 rngfUter, thiamm&-HCl, 10 mg/Uter; rnqotinic acid, 1 mg^liter; iryndoxme- 
HCl, 1 mg/liter), suonse (30 goiter) and phytagar (6 g^ter) at pH 5.8. Fully expanded, daik 
green leaves of about two month old plants win be used for bombaidment 
5 Leaves will be placed abaxial side up on a Whatman No. 1 filter p^>er laying on the 

RMOP medium (79) in standard petn plates (100x15 mm) for bombaidment Gold (0.6 ^m) 
micropiqiectiles will be coated wifli plasmid DNA (chlorc^Iast vecfois) and bombanhnraits will 
be canied out witti the biolistic device FDSIOOO/Hie (Bio-Rad) as described by Danidl (110). 
Fbllowipg bombaidment^ petri plates wiO be sealed widi paia£ilm and incubated at 24*^C under 
10 12hpliotoperiod Two days aficrbombaidmcDt, leaves wiU be dic^iped into sm 
ntm^ in size and placed on the sdection mediam CEtMOP contaiim 

dthydnochloride) with abaxial side toudiang lim medium in deep (100x25 mm) petri plates (-AO 
pieces per plate). The reg^erated spectinomycin resistant shoots will be chopped into small 
pieces (~2mm^ and subcloned into fiesh de^ petri plates ('^5 pieces per plate) containing the 
15 same selection medium. Resistant shoots from the second culture cycle will be transfened to the 
rooting medium (MSO medium siq>plemented with JBA, 1 mg/liter and spectinomycin 
dihydrochImde> 500 mgHitCT). Rooted plants will be transferred to soil and grown at 26X 
under 1 6 hour photoperiod oxiditions fox farther analysis. 

20 PCR analfsis of putative transformants: FCR will be done using DNA isolated from control 
and transgenic plants in order to distingoi^ a) true chiorqplast transformants from mutants and 
b) chloiQplast transformants from nuclear transformants. Primm for testing tiie presence of the 
aadA gwe (diat confers spectinomycin resistance) in transgenic plants will be landed on die 
aadA codmg sequence and 16S rRNA geaie. in order to test chloroplast integration of the genes^ 

25 cxie primer will land on the aadA gene vainle another will land on the native chknoplast genome. 
No PGR product will be obtained with nuclear transg^c plants using this set of pximecsw The 
primer set wfll be used to test integration of die entire gone cassette witfaoi^ 
(X'kK>ping out during hornologqiosiecombiiiatLQXL Similar^trat^ has been used successfully by 
us to con&m diloxoplast integration of foiuigp gjBnes (3^85-88). This soeenir^ is essostial to 

30 diminate mutants and nudear transformants. In order to conduct PGR analyses in transgenic 
plants^ total DNA from unbombaided and transgenic plants will be isolated as described by 
Edwaids et aL (129). Chloniplast transgenic plants containing die desued gene will be moved to 
seccxid round of selection in order to achieve homoplasmy. 

35 Southern Analysis for homoplasnqr and copy number: Southern blots will be done to 
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copy number of the mtioduced foieign gene per ceil as well as to test 
homoplasmy. There are several fhousand copies of the chloroplast genome present in each plant 
celL Thiocfare^ when foreign genes are inserted into the chloroplast genome, it is possible that 
some of the chloroplast genomes have foreign genes integrated while others remain as tbe wild 
5 type (heteroplasmy). Therefore, in order to ensure that only the transformed genome racists in 
cdls of transgenic plants (homoplasmy), tiie selection jHOcess will be continued. In order to 
confirm that the wild type genome does not exist at tibe end of the selection cycle, total DNA 
firom transgenic plants should be probed witii the ddoroplast bordor (flanking) sequences (the 
tml-tmA fragment). If wild type g^ncxnes are presoit (heten^lasmy), the native fiagpient size 

1 0 will be observed along wifii transformed genomes. I^esence of a large fiagment (doe to insertion 
of foreign geiies within die flanking sequences) and absence of tlie native small fiagment should 
confirm hcmioplasnQr (S5,86,88). 

The copy nmnber of the integrated g^e wiU be determined by establishing homoplasmy 
for the transgenic chloicrplast genome. Tobacco chloroplasts contain 5000-10,000 copies of their 

15 genome per cell (86). If only a fiaction of the genomes are actually trans^Drmed, die copy 
nmnber, by de&olt, must be less than 10»000. By establishing that in the transgenics the gene 
inserted transfonnedgpnome is flie only one present, one could estaUish that to 
5000-1 0,000 per celL Tins is nsualfy done by digestmg die total DNA with a suitable restriction 
eoB^me and probiug widi die flanking sequences that enable homologous recombination into die 

20 chlorDplast genome. The native fragment present m the contml should be absent in die 
transgenics. The absm^e of native firagment pioves that only the transgenic chloroplast genome 
is present in die cell and there is no native, untransformed, chloroplast graiome, without die 
foreign gene present This e^ablishes the homopiasmic nature of our transfonnants, 
^multaneousfy providing us with an estimate of 5000-10,000 oopies of the foieigp genes per 

25 celL 

Northam Analsfds for transcript stability: Nordiem blots will be done to test tiie efficiency of 
transcription of the genes. TGtBiIQ<CA.winbeisolatedfifoml50mgof firozaenleavesbyusii^ 
'^easyllant Total RNAIsoladon]ar(Q]agenIn^^ RNA (10-40 pg) will 

30 be denatured by fiirmaldehyde treatment, sq»rated on a 1,2% agarose gel in tibe presence of 
formaldehyde and transferred to a nilxocettulose membrane (MSI) as described in Sambrook et 
al. (130). Probe DNA (proinsulin gene coding region) will be labeled by the random-primed . 
method (Promega) with ^kICTP isotope. The blot will be pre-hybridized, hybridized and 
washed as described above for sou&ein blot analysis. Transcript levels wiU be quantified by the 

35 Molecular Analyst Pjrogcam using the GS-700 Lnaging Densitomet^ (Bio-Rad, Hercules, CA). 
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Expression and qnantificatian of the total protein expressed in cUoropIast: Chloioplasl 
eTqjression assays will be done far each protein by Western Blot Recombinant protein levels in 
transgenic plants will be detennined using quantitative FFJSA assays. A standard curve will be 
5 generated nsing known concentrations and serial dilutions of recombinant and native proteins. 
Different tissues wiU be analyzed using young, mature and old leaves against these primary 
antibodtes: goat anti-HSA (Nordic Inmmnology), anti-IGF-I and anti-foterferon alpha (Sigma). 
Bound IgQwiU be measured usinghoirsecadishpertmdase^abel]^ 

Inheritance of Introduced For^;n Genes: While it is imlikely &at introduced DNA would 

10 move fiom the chloroplast genome to nuclear gen(Mne» it is possible diat the g^ could get 
integrated in the nuclear gsnome during bombardment and letnain undetected in Southern 
analysis. TheieKxie, in initial tobacco transfoimaiitB» some will be allowed to self-poUinate, 
¥^Kereas ottiers will be used in recqparocal crosses with control tobacco (transgenics as female 
accqrters and poUen donon^ teslmg fixr maternal inbentance). Eforvested seeds (Tl) will be 

15 genninated on media csontdning spectimmiycin. Achievement of homoplasniy and mode of 
inheritance can be classified by looking at genninati(Mi results. Homoplasmy should be indicated 
by totally green seedlings (86) wiuic heteroplasmy is displayed by varkgated leaves (lade of 
pigmentation, 83). Lade of variation in chlorophyll pigmentation among progeny ^ould also 
undecsccHe the absoice of position efifect, an artifact of nuclear transformation. Maternal 

20 inheritance will be demonstrated by sole transmission of introduced genes via seed g^emted on 
transgenic plants, r^ardless of pollen source (green seedling? on sdective media). When 
transgenic pollen is used for pollination of control plants, resultant progBoy wofuld not contain 
resistance to chemical in sdectcve media (will sppcar bleached; 83). Molecular analyses wiU 
confirm transmissicm and expression of introduced gcnes» and T2 seed will be gpnerntftd ficm 

25 diose confirmed plants by die analyses described above. 

d»6 Purification methods: The standard method of purification will employ dassical 
biochemical techniques with die crystallized proteins inside the chloroplast In tins case, the 
hornogenates win be passed throu^iniraclotti to remove cdlddnis^C^^ 10,000 xg 

30 would pellet all foreign proteins 0). Proteins will be solubilized using pH, tenq>eiatQre gradient, 
etc. This is possible if the ORFl and 2 of the cry2Aa2 operon (see section c) can fold and 
crystallize the recombinant proteins as expected. If fbm is no crystal formation, other 
purification mettiods will be done (classical fatodsemistzy techniques and afBnity columiK with 
protease cleavagie). 
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HSA: Albumin is ^ically administered in tens of gram quantities. At a purity level of 99.999% 
(a level considered su^deat for other recombinant protein preparationsX recombinant B5A 
(rHSA) inqmnties on the order of one mg will still be injected into patients. So impurities ftom 
5 the host oig^nism must be reduced to a minimum. Fmthennore, purified iHSA must be identical 
to hmnan HSA. Deqnte these stringent lequiremCTtSy pmification costs must be kept low. To 
purify the HSA detained by gene manipntoioD, it is not ^ipropriate to apply the conventioztal 
processes for purifying HSA (mginating in pla^ as such. Tim is because die impurities to be 
eKminated frrnn yfTSA cmwpletely differ fmm those cftntalned in. Aft W5gA ftrigiymtrng in ployma 
10 Namdy»iHSAi5cosdaininatedwitfa,fiNr 

HSA, proteins originating in the host cells» polywcchnrides, etc. En particolar» it is necessazy to 
suf^ently eliminate components originating in the host cells^ siu^ 
living organisms jnchiding hmnan and can cause the problem of antigpnicify. 

15 Li plants two different methods of HSA purification have been done at laboratory scale. 

Sijmons et aL (23) tranafixmed potato and tobacco plants wt&i Agrohacterhm tum^adens. For 
die extraction and pniificaticii of HSA, lOOOgofstmandleaftissQewBshoniogNUzedinlOOO 
ml cold FBS, 0.6% PVP, 0.1 mM FMSF and 1 mM EDTA. Tbe lK>mogBnatB was clarified by 
filtration, ceotrifiiged and the siqiematant incubated jfor 4 h with 1.5 ml polyclonal autiHSA 

20 coupled to Reactigel spheres (Pierce C3iem) in the presence of 0 J% Tween 80. The camples 
HSA-anti HSA-Reactigel was collected and washed with 5 ml 0.5% Tween 80 in PBS. HSA was 
desorbed fi^om the reactigel con^lex with 2.5 ml of 0.1 M ^ycine pH 2.5» 10% dioxane» 
immediately followed by a bafif^ exchange witii Sephadex G25 to 50 mM Tris pH 8. The 
san^le was then loaded on a HR5/5 MonoQ anion exchange cohmm (Pharmacia) and ehited 

25 with a linear NaQ giradient (0-350 mM NaQ in 50 mM Tris 8 in 20 min at Iml^mn). 
E^ractions containing the cGnoentrated HSA (at 290 mM NaO) were lyqphilized and ^Hed to a 
HR 10/30 Sqpliaiose 6 ccdomn (Pharmacia) in PBS at 03 ml/min. However, this me&od uses 
afiSnity cohmms (po^kmal anti-HSA) that £te very esqiensrve to scale-up. Also the protein is 
released ficom the cohmm with 0.1M gtydne pH 25 diat will most probably, denature &e 

30 protein. Thraxfbr^ tins method wiH be suitably modified. 

The second method is for HSA extraction and pmification firom potato tubers (Dr. 
Mingo-Casters laboratory). After grinding the tuber in phosphate buffer pH lA (1 mg'2ml), the 
hompgenate is filtered in miraclodi and ceotrifiiged at 14.000 rpm 15 minutes. After this st^ 
35 another filtration of tiie supernatant in 0.45 Dm filters is necessary. Then, chromatography of 
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i in FPLC using a DEA£ S^^harose Fast Flow coliinm (Am^rsham) is icquiied. 
Fracdons recovered aie passed tiarough an afOnity cohinm (Blue Sqihaiose &st flow Amersham) 
resulting in a product of hi^ purity. HSA purification based on both melhods will be 
iavesdgatBd. 

5 

IGF-1: An earlier attempts to produce IGF-I in K coli or Saccharomyces ceresasiae have 
resulted in misfolded proteins. This has made it necessary to perfonn additional ih Wih? refolding 
or extensive Bcpass&m techniques in order to recover the native and biological fozm of &e 
molecule. Da addition, IGF-I has been draaiQDStEated to possess an intrinsic tiieanodynaxnic 

10 folding pioblran with legard to quantitatively foklii^ into a native disidCde-bonded 
conformation in vitro (131). Sanmelsson el aL (131) and Joly et al. (132) co-e?qnes8ed IGF-I 
with specific proteins of £L coH that sigmficantly inq>ioved the relative yields of correctly folded 
protein and consequent]^ fecilitating purification* Samuelsson et al. (132) fiised the protein to 
afBnify ta^ based on either the IgO-binding domain (Z) fiom Staph}dococcal protein A or &e 

15 two senmi albumin domains (ABP) fiom Stccptococcal protdn O (134). The fusion protein 
concqpt allows the IGF-I molecules to be Inirified by IgG or HSA affinity chromatogr^y. We 
also couM use this Z tags for protein purification includmg the doub^ aureus 
protein and a seqnoice recognized by TEV protease (see section d2). The fiision protein will be 
incubated with an ](gjG column where binding via the Z domain is ^cpected to occur. Z domain- 

20 IgO interaction is very specific and has high afGnity, so contaminant proteins can be easOy 
washed off the column. Lttcubation of the coluam with TBV protease will elute mature IGF-I 
fiom the cohmm. TEV protease ^ produced in bacteria in large quantities fiised to a 6 histidine 
tag that is used for TEV purificatton. Tins tag can be also used to sq)ar8te IGF-I fiom 
contaminant TEV protease. The method could be tested easily in E. coU before doing tobacco 

IFN-D:Intiie£L cofi esqyiesdan method devdoped in Dr. Ihrieto's laboratory 
the porificatian system was based on using 6 Histidme4ags that bind to a nid»I cohmm and 
tnothrylated thnnnbin to elmdnate the tag on IFN-D5 (see section d^). We propose usfaig tiie 
30 same method as a first attempt fiir purification. This method could be tested in K call expressed 
proteins. 

d«7 Characterization of the recombinant proteins: For die safe use of recombinant proteins as 
a replacement in any of the current q>plications» these proteins nmst be structuially equivalent 
35 and must not contain abnormal host-derived modifications. To confinn compliance with these 
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1 compare haman and lecombiiiaiit prateios usmg fbo camaaUfy hig^ily sensitive 
and higbl^ lesotving techniques expected , by tibe legnlatoiy authonties to chaiacteiize 
recombinant products (1 35). 

5 I- Amino add analysis: A mino aiad analysis to confim 

fidlowing ofMine vapour phase hydrolysis using ABI 420A amino add derivatKaer with an 
on line 130A i^xak}dMocaifaamyl^amino add analyze (AppHed Biosystems/ABQ. N> 
terminal sequence analysis vnU be perfomed fay Edmandegradadon using ABI 477Aprotem 
sequencer with an on-line 120A phenyUiiohydantdn-aniino acid analyzer. Automated C- 

0 (ffiTninal sequence analysis will use a Hewiett-Pockaid G1009A protdn sequencer. To 
- confirm the C-terminal sequence to a greater mnnber of residues, fte C-tenmnal tiyptic 
peptide win be isolated fiom tiyptic dig^ by reveis&phase HPLC. 

2- Protein folding and disulfide bridges formation: Western blots with reducing and non- 
reducing gels will be done to chedc protdn folding. PAOE to visualize small pxotdns will be 

5 done in the presence of tddne. Ftotem standards (Sigma) will be loaded to oo mp ai e the 
mobiUfy of the recombinant proteins. PAGE will be padbrmed on PhastGels (Phamiada 
Biotech). Proteins will be blotted and then probed with goat anti-HSA, intei&ron a^pha and 
IGF-I polydonal antibodies. Bound IgQ will be detected wib horseradish perooddase- 
labelled anti gpat 1^ and visualized on X-iay film using BCL detection reagents 

0 (Amersham). 

3- Trvptic mapping : To confirm ttie presence of chloroplast expressed proteins with disulfide 
linkage identical to native human proteins^ the saiiq>ies will subjected to tryptic digestion 
followed by peptide mass mappmg using matxix'-assisted laser desoiption ionization n^ss 
spectrometry (MALDI-MS). San^les will be reduced widi di^othrdtol, aU^lated widi 

5 iodoacetamide and tiien digested with tiypsin oonqHisiiig tinee additions of 1:100 
enzymc/substtato over 48h at 37*^CI SiAsequently tiyptic pqptides wiD be sqpaxated by 
reveis&phase HPLC cm a Vydac C18 cohnmu 

4- Mass analysis: B lectrospray mass q)ectrometcy (ESMS) will be pofomoed using a VG 
Quattro dectro^iay mass spectrometer. Samples will be desalted prior to analysis by 

D reverse-phase HPLC ushig an acetosutrile gradient contaiziing 

5- CD : Spectra will be measured in a nitzpgjea atmosphere using a Jasco J600 
^^ectropolarimeter. 

6- Chromatographic technicfues: F or HSA, analytical gel-^>eimeation HPLC will be performed 
using a TSKG3000 SWxi cohmm. Pr^»rative gel permeation chrcunatogr^hy of HSA will 

5 be perfonned usmg a Sq^hacryl S200 HR cohmm. The monomer fiaction, identified by 
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at 280 nni» will be dialyzed and teconcentcated to its starting concentration. For 
IGF-I» ^e reversed-phase chiomatograpliy the SMART system (Hiannacia Biotech) will be 
used with the mRPC C2/18 SC 2.1/10 cohunn. 

7- Viscositv: This is a classical assay for recombinant HSA. Viscosity is a characteristic of 
5 proteins related directly to their size, shape;> and conformation. The viscosities of HSA and 

lecomhmant HSA will be measured at 100 mg. Ml-1 in CIS M NaC3 nsing a U-tube 
viscosinieter (^0 ^pe, Poulton^ SdLfe and Lee 

8- GlvcQSvtetiQn: Cfakaxylast proteins are not known to be (Biycosvlated> However tfage are no 
pubficatioEns to oonfiim or refute iMs assunq^on. Therefoie gjtyco^lation will be measured 

10 using a scaled-^q^versioaoflfaemeEthod of Ahmed and 

d»8 Biological Assays: 

Since HSA does not have enzymatic activity, it is not possible to run biologtcal assays. 
Three different techniqoes will be used to diode IGF-I functionality. AQ of them arc based on die 

15 proliferation of IGF-I re^xmdingcells. First radioactive diymidine uptake will be mcasored in 
3T3 fibroblasts, that express IGF-I receptor, as an estimate of DNA synthesis. Alsc^ a Imman 
megakaryoblastic cell hne^ HU-S, will be used As HU-3 grows tn snspeaisiim, changes in cell 
number and stimulation of glucose uptake induced by IGF-I will be assayed usmg AlamarBhie 
or ghicose consunqytion, respectively. AlamarBlue (Accumed International, WestialDcOH) is 

20 reduced by mitochondnal enzyme activity. The reduced form of the reagent is fluorescent and 
can be quantitatively detected, with an excitation of S30 nm and an emission of 590 nm. 
AlamarBlue will be added to the cells for 24 boms after 2 days induction witii different ^ses of 
IGF -I and in tiie absence of serum. Glucose consumption by HU-B cells will be measured using a 
colorimetric ghicose oxidase procedure provided by Sigma. HU-3 cells will be mcobated in the 

25 absence of serum with different doses of IGF-L Ghicose wiU be added Sat 8 houxs and glucose 
concentration will be measured in the supernatant All flnee methods to measure IGF-I 
functionality are precise^ accurate and dose dependent, witti a linear range betweoi 0.5 and SO 
ngM(137). 

30 The method to determine IFN activity will be based on tiidr anti-viral properties. Thb 

procedure measures tiie ability of IFN to protect HeLa cells against the cytopathic effect of 
encephalomyocarditis virus (EMQ. The assay will be performed in 96-well microtitre plate. 
First, IfeLa cells will be seeded in the wells and allowed to grow to confhiency. Then, the 
medium will be removed, replaced witii medium containing IFN dilutions and incubated Sac 24 

35 hours. EMC virus will be added and 24 hours later tiie cytopathic effect will be measured. For 



wo 01/72959- 



272 



PCT/US0iya6288- 



on will be removed, wells wiU be nnsed two times with PBS and stained with 
methyl violet dye sohitton. Hie optical densi^ will be read at 540 nm. Hie values of optical 
density are proportional to Ae antiviral activity of BPN (138). Spedfic activi^ will be 
detennined mfJi reference to standard IFN-D (code $27576) obtained firom NIBSC. 

5 

d.9Aiiiiiial testing and Pre-Oinical Trials: 

If alhiimin can be produced at adequate levels in tobacco and the physicochendcal 
properties of tiie pnxioct cmespand to those of the natnral protein, toxicology studies need to be 

10 done in mice. To avoid mice response to the human protem, transgjeoic noice carrying HSA 
genomic sequences will be used (139). After injecticm of none, 1, 10, 50 and 100 mg of purified 
recombinant protem, classical toxicology studies will be carried out (body wei^ and food 
intake, animal bdiavior, piloerection, etc). Pharmaceutical companies will be contacted for 
flutter toxicology studies and clinical development of the product Albumin could be tested for 

15 blood volume replacement after paracentesis to diminafte the flmd ftom the pedtoneal cavity in 
patients widi fiver dniiosis. U has been shown that albnnun infiisian after tiiis maneuver is 
essoottial to preserve effecdve drculatory vohime and renal function (140). 

ICSB'l and IFND will be tested fat biological effects in vi\H> in animal models. Dr. 

20 Prieto*s laboratcny has extensive experience working with woodchucks (mamota monax) 
infected with the woodchudc hepatitis virus (WHV), widely consid^ed as the best animal model 
of hq>atitis B virus infection (141). Prelhninary studies performed in Dr. Prieto's laboratory have 
shown a Significant increase in 5' ollgoadenylate syntiiase RNA levels by real time polymerase 
chain reaction O^CR) in woodchuck perq>h6ral blood tnononuelear cells iqxm incubation with 

25 human IFND5, a proof of bidogical activity of the human OFNQS in woodchuck cells. For in 
vhv studies, a total of 7 woodcinidks dirooically infec 

WHV-DNA positive in serum) will be used: 5 animals wiU be iiyected subcutaneouEdy with 
500.000 units of human IFN05 (die activity of human IFND5 will be d^ermmed as described 
previously) three times a week ftnr 4 months; the remaining two woodchucks will be iiyected 

30 with placebo and used as controls. Follow-up will include weekly serological (WHV sur&ce 
antigen and anti-WHV surface antibodies by HLISA) and vholpgical (WHV DNA in serum by 
real time quantitative PGR) as well as monthly immunological (T-helper responses against WHV 
surl^ and WHV core antigens measured by interleukin 2 production from PBMC incubated 
witii tiiose proteins) studies^ Finally, basal and end of treatment Uv^ biopsies will be per&mned 

35 to sooee fiver mflamniation and intrahepatic WHV-DNA levels. The final goal of tieatmait will 
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f vital iq;)lication by WHV-DNA in serum, widi secondary end points being 
histological improvement and decrease in intrahepatic WHV-DNA levels. If IFND5 proves to 
exert antiviral activity against WQV in the woodchuck, a search will be conducted £ar possible 
industrial partneis interested in the clinical develop 

5 

For IGF- i , the vivo ttiei^>eiitic ^cacy will be tested in animals in srtuaticHis of IGF-I 
deficiency snch as liver dnbosis in rats. Dr. Fdeto's lab has publidied several reports (56-58) 
showing that recombinant fauman IGF-I has marked beneficial effects in increasing bone and 
muscle mass» inqmyving liver fimction and conecdng hypogonadiam. Bdefty^ fte mrhigtinn 

10 protocol will be as follows: liver cirxhoaia will be induced in rats by tnlmliiHftn of carbon 
tetrachloride twice a week for II weeks» with a progressively iricieasingesqmsuretirnefiGml to 
5 minutes per gassing session. After the 11^ wceky animals will continue receiving CCI4 once a 
wedc(3 rmnutes per inhalation) to oxtnplete 30 wedcs of CXn4 admirdstration. During the ^diole 
induction pmod, phenobaifaital (400 mg/L) will be added to rfriTilring water. To test &^e 

15 thcc^>eutic efficacy of tofaacco-dmycd I€9P-]^ cirrhotic lats will receive 2 fig^lOO g bod|y 
wei^htt/day of this compound m two divided doses» during tiie last 21 days of tiie induction 
protocol (weeks 28, 29, and 30). On d^ 22, animals will be sacrificed and liver and blood 
sainples will be collected. The results will be conq>aied to those obtained in canbotic animals 
receiving placebo instead of tobacco-derived ICH^-I, and to healthy control rats.. As in the case of 

20 IPNQ, if plant-derived IGF-I (in addition to exerting characteristic biological effects in vitro) 
reproduces the effects of ^ commerciai recombinant IGF-I in vivo. Pharmaceutical companies 
will be contacted for further preclinical and clinical development TGF-I can be tested in patients 
with Hver cirrhosis arid poor nutritional status. 

25 Import of certain passages from S^tain inactivated spelling and grammar check 

fimction for this file. Every effort was made to do spelling and grammar checks manuaiiy. 
Investigate apologize to reviewers for inadvertent omissions. 

30 

Tentative Proposed Schedule 



Year I: 



wo 01/72959 



274 



PCT/USOl/06288 



combinaot DNA vectois for eaihanced traoslatioii of all th6anq>eutic piotciiis via 
chlon^last genomes of tobacco 

b) Test piotempmification and piocessiiigiiskgdiloroplastsvec^ 

c) Obtain transgenic tobacco plants using Ihetzansfonnation vectois 
5 d) Assay transgenic expression of 1herq)aiticpiDteins in 

and faiocfaemcal methods 



Yearn: 

a) Develc^ recombinant DNA. vectois for enhanced translation of all dierq>eiitic proteins via 
10 di]on)plastgea<miesoftobacco for efficient processing 

b) Test protein poiificatioo and ]m>cessingnsingchloioplastsvec1oxs in £ 

c) Obtain tran.<^em'c tobacco plants oang the transformation vectois 

d) Assay transg^c esqpression of processed tiser^peutic proteins in chloroplast? using 
molecular and biochemical mcttiods 

15 

Yearlll: 

a) Employ existing inetfaodsofpurification from transgenic leaves develop new approac^ffis 

b) Analyze genetic conqK>sxtiGn of tranggenic plants (Miendefian or matemal inheritance) 

d) Laig^ scale ponfication of thecapeudc proteins ftom green 
20 cQiiq[iacison of cmient purification medu)ds with ^ 

e) Aniinal testing, pro-diiiical trials 

Year IV 

a) Refolding and characterization/conqmiison (yield and purity) of therapeutic proteins 
25 produced in ^'.co&'tfT- yeast wi&tEan^Bnic tobacco 

b) AmmaltestirigypreTclhiicaltdals 

c) Continue to characterize subsequent transgBDic generation 



wo 01/72959 



275 



PCT/USOl/06288' 



1 GM 63879- 01 

EXPRESSION OF HUMAN THERAPEUTIC PROTBNS IN TRANSGENIC TOBACCO 
CHLOROPLASTS - Henry Daniel! 

In the near future, demand for existing biopharmaceuticals as weO as new therapeutic proteins is 
expected to rise considerabV- Therefore, It Is important to evaluate alternative transgenic production 
systems and ensure avaiiabiBty of safe tNopharmaceuticals In a cost effective manner. Chloroplast 
genetic engineering promises to be one of the best available technique since foreign gene expression 
up to 46% of total soluisle protein h^ been demonstrated rec^itiy (study from our lab featured on the 
cover of Nahire Biotechnology, January 2001). The sp^iRc aim of this proposal is to optimize 
producGon of therapeutic proteins such as human semm albumin (HSA), insulin like growth factor 
(IGF4) and interferon a (iFNa) In chloroplast transgenic plants for future utlTization of the system for 
biophannaceutical production in plants. 

Chloroplast expression of Human Serum Albumin (HSA): We have already initiated transfomnatlon 
of the tobacco chloroplast genome for hyperexpression of HSA. The HSA codon compositioh is ideal 
for chloroplast expression and no changes in the nucleotide sequence were necessary (see page 37 
of the proposal). We designed several vectors to optimize HSA repression using different 5* regulatory 
reglonSw AO these contam ATG as the first amino add of the mahire protarn. The first vector (pLD-RBS- 
HSA) includes the chloroplast prefenred Riboscwne Binding Site (RBS) sequence GGAGG. In the 
second vector (pLD-5^bA4-ISA) HSA was doned downstream of the psbA 5' UTR including the 
promoter and untranslatBd regicvi, which has been shown to enhance translation. The third vector 
(pLQ-OrftOrfZ-HSA) introduced the putative chaperonin (Orf2) of the B.t cry2Aa2 operon upstream of 
the HSA gene, which has been shown to fold foreign proteins and fbmn crystals, aiding in protein 
stability and purification. 

All chloroplast vectors were txvnbarded Into tot>acco teaves via particle bombardment and after 
4 weeks shoots appeared as a result of independerrt transformation evente. All shoots were tested by 
PGR to verify Integration into the chloroplast genome using ^e method descnbed on page 39 of the 
proposal The positive dones were passed through a second round of selectton to achieve 
homoplasmy and transfened to pols. The phenotype of these planto was completely normal. 
Transgenic leaves analyzed by western blots showed consistently the same pattern of expresston 
depending on the 5' region used in the transtonnation vector (see Rgure 1). Maximum levels of 
expression were obsen^ed in the plants transformed with the HSA preceded by the psbA 5* UTR and 
promoter. Molecular characterfeation of the Urst generation in progress. Southern blots of several 
dones showed homopiamy In aB transgenic lines accept one (see clone # 6, Figure 2). Northern blots 
showed different length of transcripts depending on the 5* regulatory region that was inserted upstream 
of the HSA gene (see Figure 3). The n>ost abundant transcr^ was the monodstron in plants with the 
ffpsbA promoter upstream of ttte HSA gene. Polydstrons of different length were otiserved based on 
the number of promoters used to each oonstmct and differential processing. 
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bserved different levete of HSA in ELISA depending on the extraction iDuffer used 
and furttier optimizatfcH) of this procedure in progress. With incomplete e)ctraction procedures, the 
highest HSA level of expression in plants transfonmed with pLE>5*psbA-HSA was up to 11.1% of total 
$olut)le proteir^ thte is more than 100 fold the expression observed with other two constructs (see 
Rgure 4). Because we have routinely obsen/ed high levete of foreign gene expression with other two 
vectors, we anticipate that the actual level of HSA repression in pLIV5*psl3A-HSA may mceed 50% 
of total soluble protein. Since the expression of HSA under the 5'psbA control is light dependent, the 
time of the tissue harvest for expression studies is important Such changes In HSA accumulation are 
currently being investigated u^'ng EUSA and Northerns. 

Characterization of HSA fifom transgeruc chloropiasts for proper folding, disulfide bond 
fonnatlon and fUnctlonaltty Is In progress. The stromal pH within chloropiasts and the presence of both 
thioredoxni and disulfide isomerase systems provide opfimai conditions for proper folding and disidfide 
bond formatkm within fblded HSA 

Chloroplast expression of Insulin Like Growth Factor (IGF-I): From previous studies (see page 30, 
table 1) we obsen^ that IGF-I gene coding sequence is not suitatrie for high levels of expression In 
chloropiasts. Therefore, we have detenmiiied the optimal chloroplast sequence and employed a 
recursivB PCR method (see page 37) for total gene synthesis (see Figure 5). The newly synthesized 
gene was cloned into a PCR 2.1 vector. Insertion of zz-tev sequence upstream of IGF1 coding 
sequence (see pages 41-42) for facifrtating subsequent purification is in progress. 

To demonstrate expression* purification and proper cleavage of ttie fusbn protein we also 
cloned the full length IGF-I Ondudlr^ the pre-sequence) In an alphavirus vector and expressed the 
proteh in human cultured cells. Alphavirus system has been used because ft expresses adequate 
amounts cf protein to induce a very good immune rasponse In test animals. We obeen/ed that the 
protein had the predicted size, ts property cleaved in cells to produce ttie mature protein and is 
exported Into the growth medium. This secreted protein could be immunoprectpitated using antHGF^ 
antibody* The zZ"4ev»IGF-l was also cloned in an alphavirus vector, repressed and lat>eled In human 
cultured cells. This has aBowed us to see that the proteni had the predided size and as expected, is 
not secreted. To cleave zz tag after purification from chloropiasts, TEV protease is necessary (see 
page 42). TTierefore, we have expressed and purified TEV protease In bacteria. After purification we 
couid obtain approximately 0.5 mg. This TEV protease deaved the ybeled zz-tev-IGF-l produdng two 
fragments, zz-tev and mature IGH. We are cunrently labeOng more fusion protein to optimize 
conditions for TEV deavaga 

Chloroplast expression of Interferon a5 (IFN-a5): As proposed, we have doned human IFNoS. 
fused with a Histkiine tag (for helping in further purmcatfon, see page 42) and introduced the gene into 
the chloroplast transfonnation vector (pLD). Western blots demonstrated expression of tt» IFIMoS 
protein in E coh using pLD vectors, and the maximum level was observed with the 5'psbA UTR and 
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gene was cloned into the pLD using lx>th sequences and t)ombarded into tobacco 
leaves. Shoots appeared after 5 weeks and the second round of selection is in progressL 

All proposed experiments done so far have yielded results as expected. With successful hyper- 
expression of HSA observed in chloroplast transgenic plants (SOCMbId higher than previous reports oT 
nudear transgenic plants in the literature), we are optimistic that the transgenic chlonq[)lasts will 
emerge as a biopharmaceutlcai production system bi the near future. NIH funding to support of this 
proposal would make this a reality. 

• Expression of HSA via tlie chloroplast genome in tobacco. 




Flgnre 1: Western Blot of tobacco protein extracts. A) 1: 40 ng pure HSA; 2x molecular weig^ raaiker; 3»4,6: 
uixtzansformed plant extracts; 5: extract from plants transformed \wth: PLD- 5*UIR-HSA; 7: jiAJ^OrflOrC-fisA. B) 
1: 40 ng pure HSA; 2: mdecular weight marker; 3,5: untransfonned plant extracts; 4: extract thorn plants transformed 
widi: FLD- RBfrHSA; 6: pLI>OflOrC24iSA. 10 micrograms of piant protein were loaded in eadi weO. 
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L A stable plastid transformation and expression vector competent for stably 
tiansfonning a plastid genome vAnoh comprises an expression cassette comprising as operably 
linked components, in the 5* to the 3' direction of translation, a promote operative in said 
plastid, a selectable maricer sequence^ a heterologous DNA sequ^ice coding for a biopolymer- 
promsulm fusion gene, a transcription tenntnatioii r^on functional in said plastid, and flanking, 
each side of the expression cassette, flankmg DNA sequences which are homologous to a DNA 
sequence inclusive of a q>ac^ sequ^ice of the taiggt plastid ^om^ wbmby stable integratbn 
of the hetmlogous coding sequence into the plastid g^iome of the target plant is facilitated 
tfaiouf^ioiit homologous reeombinaticm of tbe fiankiiig sequence witii tiie homotogous seqiraoes 
in the taig^ plastkl genome. 

2. A stable plastid transfcurmation and expression vector competent for stably 
transforming a plastid genome which comprises .an expression cassette comprising as operably 
linked components^ in tiie 5' to die 3' direction of translation, a promoter operative in said 
plastid, a selectable marker sequence, a heterologous DNA sequence coding for a diolera toxin 
B-subunit-proinsulin fusion gene, a transoription termination region functional in said plastid, 
and flanking, each ^de of the expression cassette^ flanking DNA sequ^ces whidi are 
homolo^us to a DNA sequence tnchisive of a spacer sequence of the target plastid genome, 
whereby stable integration of the heterologous coding sequence into the plastid genome of the 
taiget plant is jSicilitated ttuoiighout homologous recombination of the flanking sequence with 

. the homologous sequences in the target plastid gencnne. 

3. A stable plastid transformation and eqitession vector ccHnpetent for stably 
transfcarmtng a plastid genome vAnch comprises an expression cassette comprising as operably 
linked components, in the 5' to die 3' direction of translation, a promoter operative in said 
plastid, a selectable maxter sequence, a heterologous DNA sequence coding for a plastid DNA 
fragment comprismg a S'UTR sequ^oce positioned upstream of the promoter to enhance 
translation of proinsulin protein, a transmption tmunation r^on fimc^ 

flankmg, each ^de of the e3q>ression cassette, flanking DNA secpiences which are homologous to 
aDNA scqdcncc inclusive of a spacer sequence of the targ^ plastkl genome, whereby stable 
integration of the heterologous coding sequence mto tiie plastid g^iome of the target plant is 
&ciiitatBd througliout homologous recombination of the flanking sequrace with the homologous 
sequences in the tar^ plastkl genome. 

4. A stable plastid transformation and expression vector competent fcH* stably 
transforming a plastid genome which comprises an expression cassette comprising as operably 
liidced components, in die 5* to die 3* dkection of translation, a promoter operative in said 
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table marker sequence^ a heterologous DNA sequence further coding for a plastid 
DNA fragment comprising a S^UTR sequence positioned upi^ream of the promoter and Ae 
selectable maricer sequence to fuitii^ enhance translation of its promsulin fHrotein, a transcription 
termination region functional in said plastid, and flanking, each side of the expression cassette, 
flanking DNA sequences which are hcunologpus to a DNA sequence inclusive of a spacer 
sequence of the target plasdd genome^ whmby sCable integration of the heterologous coding 
sequence mto the plastid gpnome of die target plant is fecilitated throughout homologous 
rec(»nbination of the flanlring sequence widi the homolo^)us sequences in the target plastid 
genome. 

5. A stable plastid transSKmnation and ex(»ession vectcx- competent finr stably 
transforming a plastid genome ^ich com|mses an expression cassette comprismg as operably 
linked components^ in tiie 5' to the 3' direction of translation, a jMromoter operative in said 
plastid, a selectable marker sequence> a heterologous DNA sequence coding for a Cry2aA2 
openxi which comprises two op^ reading frames (ORFl and ORP2X wh^in the ORF 
immediately upstream of the CTy2aA2 codes for a putative chaperonin^ which assist the 
ciystall^aticMi of die insulin and aid in subsequent purification , and which operon is fiised 
directfy upstream of the promoter fudon i»roteiny a transcription termination region fimctiooal in 
said plastid» and flankmgi cxii side of the ezpresskn cassette, flankmg DNA sequoices which 
are homologous to a DNA sequence inclusive of a spacer sequence of die target plastid genome» 
vriiereby stable i ntegration of the hetmlogous coding sequence into the plastid genome of die 
target plant is facilitated throughout homologous recombination of die flanking sequence with 
die homologous sequences in the target plastid graiome. 

6. A stable plastid transformation and expression vector competent for stably 
transforming a plasdd genome v^ich comprises an ^ression cassette comprising as operably 
linked components^ in the 5* to die 3* direction of translation, a promoter operative m said 
f^astid, a selectable marker sequence^ a heterologous DNA sequence further codmg finr a cholera 
toxin B-5ubunit-piastid modified proinsulm (PtPris) fiision wh^n its nucleotide sequence 
modified sudi diat the codons are optimized for plastid expression* ^ile its amino acid 
sequence rnnains identical to iiatm ImiEian prouisu^ 

fimctional in said piastid» and flankmg, each side of the cxpresskm cassette* flankmg DNA 
sequences which are homologous to a DNA sequence inclusive of a spacer sequence of the target 
plastid genome, whereby stable mtegration of the heterologous coding sequence into the plastid 
genome of the target plant is feciiitated throughout homologous recombination of the flanking 
sequence with the homologous sequences in the target plastid genome. 
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A stable plastid transfimnation and aqjiession vector competent for stably 
txansfibnnmg a plastid graome wfaidb comprises an expres^on cassette comprising as operably 
linked components, in &e 5' to die 3' direction of transiation, a promoter q)eradve in said 
plastid^ a selectable maiicer sequence^ a beterologous DNA sequence further coding for cholera 
toxin B-subunit-mini-proinsulin (Mpris) fusion wherein its codc»is are optimized for ptasdd 
expression^ while its amino acid sequence remains id^cal to native human proinsulm, a 
tianscriptiOT tmnination region functional in said plastid, and flanking, each side of the 
expression cassette^ flanking DNA sequences which are homologous to a DNA sequence 
inclusive of a spBc&r secpmce of die target plastid gpnome, vAiereby stable integration of the 
heterologous coding sequence into the plastid genome of &e taiget plant is &d]itated duoiigbout 
homologous recombination of the flanking sequence with the bcnnologous sequ^ices in the 
target jriastid genome. 

8 . A vector of claim 1 , wherein the biopolymer is a AOmer to enable hyper- 
eiqjression of the insulin and to accompli^ rapid one stop purification of the fusion protein. 

9. A stable plasdd transfonnation and expression vector competent for stably 
transforming a plastid genome ^ch comprises an expression cassette comprismg as operably 
linked compon»nts» intiie 5' to the 3' direction of translation, a promoter operative m said 
plastid, a selectable marker sequence, a heterolo^us DNA sequence further coding for synthetic 
protein-base polymer (PBP) fiised to a biolc^cally active molecule, a transcription temiination 
region fonctional in said plastid, and flanking, each side of the e}q)ression cassette^, flanking 
DNA sequences which are homologous to a DNA sequence inchiave of a spacer sequence of tiie 
target plasdd genome^ wherdiy stable mtegration of the heterologous codii^ sequence into the 
plastid genome of the target plant is fiidlitated throughout homologous recombiniaticHi of the 
flanking sequrace wi^ the homologous sequ^ices in the target plastid genome. 

10. A vector of claim 9 wherein the PBP has repeating pentamo* sequraces 
(GVGVP)„, vAa^in * V* is an mteger of I to 250, "G^ is glycme, "V** is valine, and "F* is 
proline. 

1 L A vector of claim 9,* wberem the biok)£^ly active molecule is promsulin, 
uisalm»orHSA. 

12. The vector of claims 1-1 1> wUch comprises flanking each side of the expression 
cassette, flanking DNA sequences whidi are honx>lQgous to a DNA sequence inclusive of a 
spacer sequence of the target plastid genome, which sequence is conserved in the plastkl g^me 
of dififerent plant species, whereby stable mtegration of die het^logous coding sequence into 
die plastid gjsnome of the taiget plant is &cilitaled through hcMnologous recombination of the 
flanking sequences with the homolo9>i^ seau^ices m the target ptasdd g^me. 
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A stable transformed plant which comprises plastid stably transfonned wHfa fbe 
vector of claims 1-12, or the progeny or the seed thereof 

14. A process for stably transforming a higher target plant q)ecies ^Ich compises 
introducing into the plastid gmome of the plant a vector of claims 1-12. 

15. A transformed and edible tobacco or al^fa plant of daim 13. 

16. A trans&xmed plant of claim 15 whidi is edible by humans. 

17. A stable plastid transfinmatjon and expression vector competent for stably 
transfisming a plastid genome ^rfiich oonqarises an ^qsres^on cassette comprisii^; as operably 
linked oomponentSy in tfie 5* to the 3' direction of trandation, a promoter operative in said 
plastidy a selectable marisr sequence^ a heterologous DNA sequence coding for an intofeion 
gene^ a transcription termination region functional in said plai^'d, and Hanking, each side of the 
ejqnression cassette, flanking DNA sequences which are homologous to a DNA sequaice 
mchisive of a ^sacer sequaice of ^ target plastid genome, wherel^ stable integration of the 
h^m)]ogous coding sequence into ttie plastid gmome of the target plant is facilitated dirougjiout 
hcmiologous recombination of the flanking sequence with the homologous sequences in the 
target plastid g^iome. 

18. A stable plastid transformation and e3q)ression vector competent for stably 
tcansfotming a plastid genome whidi comprises an repression cassette comprising as operably 
linked components^ in the 5' to the 3' direction of translaticm, a promoter operative in said 
(dasttd, a selectable marker sequence, a hetardogous DNA sequence coding for a insulin-like 
growfli &ctor gene, a transcription terminaticxi region functional in said plastid, and flanking, 
eadh side of tiie e>qiression cassette, flankmg DNA sequences which are homologous to a DNA 
sequence inclusive of a spacer sequence of the target plastid genome; whofeby stable integration 
of the heterologous coding sequence into the plastid genome of the targ^ plant is fecilitated 
throughout htnnologous recombination of tiie flanking sequence with the homologpus sequences 
m tiie taig^ plastid genome. 

19. A stable plastid transfmnation and expression vector competent for stably 
transforming a plastid genwne comprises an expres»on cassette comprisii^ as operably 
linked compon^its, in the 5' to the 3' direction of translation, a promote operative in said 
plastid, a selectable marker sequence, ahderologous DNA sequence coding fata human sorum 
albumin (HSA) gene» a transduction termination region fiinctbnal in said plastid, and flanking; 
eadi side of the expresskm cassette^ flankmg DNA sequences which are homologous to a DNA 
sequence inclusive of a spacer sequence of the target plastid genome, whereby stable integration 
of the h^erotogous coding sequence mto the pkistid g^me of the targ^ plant is focilitated 
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aologous reCombbation of tiie flanking sequence mlii the homologous sequences 
in the target plasdd genome. 

20. A stable vector of claim 15 wherein EFN-ocS is fiised to a 5' UTR sequence 
positioned upstream of the promoter to enhance translation of the IFN-oc5. 

21. A stable vector of claim 18 wherein IGF- 1 is fused to a 5' UTR sequence 
positioned upstream of promoter to enhance translaticm of the IGF-L 

22. A stable vector of claim 19 wherein HSA is fused to a 5* UTR sequence 
positioned upstream oftfae promoter to enhance tianslation of the HSA. 

23. A stable plastid transfinmation and ex|»essiofi vector competent fiir stably 
transforming a plasdd genome which comprises an expression cassette comprising as opcniAy 
linked components, in the 5' to the 3* direction of translation, a promoter qjiorative m said 
plastid, a selectable mafker sequence^ a heterologous DNA sequence coding for a chokm toxin 
B-subunit; a transcription termination region funcdona] in said plastid, and flanking, each side of 
the expression cassette, flanking DNA sequences which are homologous to a DNA sequence 
inclusive of a q)acer sequence of the target plastid genome, whereby stable integration of the 
heterologous coding sequence into die plastid genome of the target plant is &cUitated throug^iout 
homologous rec(»nbination of the flankmg sequence with the homologous sequences in the 
target plastid genome. 

24. A transformed and edible plant of claim 23. 

25. A transfoimed and edible plant of claim 23, wherdn the plant is tobacco or 

al&l& 

26. A stable plastid transfimnation and ex(M:essK»i vecfx>r competent Sot stably 
transforming a plastid g^ome vAddk comprises an expression cassette comprising as (^lerably 
linked components; in the 5' to die 3* direction of translation, a promoter operative in said 
plastid, a selectable marker sequence, a heterologous DNA sequence coding for a biopolymer 
&sk)n gene, a transcription termination region fimctional in said plastid, and flanking, each side 
of the expressbn cassette, flanking DNA sequraces vMoh are hranologqus to a DNA sequence 
inclusive of a spacer sequence of the target plastid genome whereby stable integration oftfae 
heteiologous coding sequence into the plastid genc»ne of the target i^anr is fecilitated tfarouj^iout 
homologous recombhiatioD of die flanking sequence with the homologous sequences in the 
target plastid genome. 

27. A im)cess for stably transformmg a high^ target plant species which comprises 
introducing into die plastid gmome of the plant a vector of claims 9-23, and 26. 

28. A process for recovering a biopolym^ by a one st^ retraction and purification 
by using tiie reversible propoty of the biopnivmer. 



wo 01/72959- 



283 



PCTAJS0iy06288 



A stably traosfonned plastid of a targst plant species of claims 1 -12. 

30. A process for recovery of a synthetic proteixi4>ase polymer (PBP) fused with a 
biologically active molecule by a one step extraction and purification by using the reversible 
property of the biopolymer of claim 28. 

31. A transformed plastid of a plant of claim 1-12, or the progeny Acrcof which 
shows the homoplasmic nature of tibie transfoimants. 

32. ' A transformed plastid of a plant of cl^m 1-12, or the progoiy thereof which 
shows the heterq[>lasmic nature of the transfonnants. 

33. A stable plastid transfonnatian and esqMresston vector competent for staUy 
transfivming a plasdd graome which comprises an expression cassette comprising as q^erably 
linked ccHnponents» in ^e S* to the 3' dkection of translation, a promoter operative m said 
plastid, a selectable mailcer sequence^ a heterologous DNA sequence coding for a 
biopharmaceutical-protdn coding g«ie, a transcription termination region functional in said 
plastid, and Qankmg, each side of &e expressi<Hi cassette^ flanking DNA sequoices which are 
homologous to a DNA sequrace inclusive of a spacer sequence of the targ^ plastid ^ome, 
whereby stable int^;ration of the heterologous coding sequence into the plastid gaiome of the 
target plant is &cilitated throughout homologous recombination of tiie flanking sequence mlh 
the homologous sequences in the target plastid genome. 

34. A stable plastid transformatk}n and esqpression vector of claim 33, wherein the 
biopharmaceutical-pnrtdn coding gene codes for insulin 

35. A stable plastid transformation and expression vector of claim 34 wherein insulin 
is natural insulin. 

36. A stable plastid transformation and expression vector competent for stably 
transforming a plastid genome whidi ccan{»i5es an expressicxi cassette comprising as operably 
linked components^ in 4ie 5* to the 3' direction of traiislation, a promoter opemdye in said 
plastid, a selectable marker sequence^ a heterologous DNA sequence coding for an opmn vMich 
comprises a putative cfafqieronin, vdiich assists flie crystallization of a protein and aids in 
subsequent purification , and which operon is fiised directly upstream of the promoter fosion 
[Hotein» a transcription termination region functional in said plastid, and flanking, each ade of 
the expression cassette^ flanking DNA sequences which arehomobgous to aDNA sequence 
mchisive of a spaca sequence of the targ^ plastid genome, wherd>y stable integration of the 
hetnofegous coding sequence into the plastid genome of the target plant is ftcilitated throughout 
homologous recombination of the flanking sequence with the homologous sequences in the 
target plastid genome.- 
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A stable plastid vector.of claim 17 or claim 18» wfaeidn the interferon is alpha 5 
]FN-oc5y or the insulin-like growA factor is IGF-1. 



wo 01/72959 



PCT/USOl/06288 



1/20 



Figure 1: Biopolymer-Proinsulin Fusion Protein lilxpression 

A.Cupper^i,aincd B. CoonuissieR.250 Stained 




24kDa 




PAGE: 15% Glycine large gel 

A. Copper Stained : Gei rinsed in waier for 10 min, stained with 0.3M CuCU for Snug, and rinsed in 

water for Smin, 

B. Coomassie R-250 Stained: The same gel was first rinsed for lOmin in water and then stmned for Ihr, 

and destined overnight. 

A. and B. Lanes. I , Prestained \faricer (BioRad) ; 2, Sonic extract of pSBL-OC-XaPris ; 3, reverse 
orientation of fusion protein of pSBL-OC- XaPris: 4, Sonic extract of pLD-OC-XaPris; 5, inverse 
orientation of pLI>OC- XaPris; 6, Sonic extract of £1 co/f strain XL-l Blue crataining no plasmid . 



C. Western Blot of Biopolymer-Proinsulin Fusion Protein 
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Clavcd 
(9 kDa) 



LLanes. 1, BioRad Prestained Marker; 2»3ug of 
Purified Human Ptoinsulin: 3. 5ug of pSBL-OC- 
XaPris (sonication and purification of biopolymcr 
twice); 4, Negative control, XL-1 Blue Keolt, 5, 
Sonic extract pSBL expressing cells (6M 
Guanidine Hydrochloride Phosphate Buffer, pH 
7.0); 6, ScMuc extract of XL-1 Bhre £ca// withno 
pSBL. 



IL Lanes. I » BioRad Prestained Marker; 2, 
5ug of Purified HmnanProbsulin: 3, Sonic 
extract of pSBL-OC-XaPris expressing cells 
(6M Guanidine Hydrochloride Pho^hate 
Buffer, pH 7.0): 4, Sonic extract of pLD- 
OC-XaPris expressing cells (Gua-HCll; 5, 
Sonic extract of XL-1 Blue Ecoli vritfa no 
plasmid. 
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Figure 2: Confirmation of Chloroplast Inte^alfoif «y PiSSL 
of Polvmer-ProinsuUn Fusion Gene 



na_ IT. JAkk S »LIWX: 



@- — 
g L^kfe- 



@ pUMX •Xarris(6.8 iCb) 



pLO flankuig scqucom 



Confirmation ofaadA integration into the chloroplast genome - Primers: 3P/3M 



1.63 u> 



3 4 5 6 7 8 10 II II 13 14 



A. Lanes. 1, 1 kb marker. 2. clone L19b (L=pLD-OC -XaPris) vectorO; 3. clone L9 (mutant): 4X1: 
5 L8d- 6, LlOa: 7. S30b (S=pSBL-OC -XaPris vector): 8. S20a; 9. S60: 10, S7a; 1 1 . S28; 12, 
S41b: 13, Petit havana (not transgenic):14. Positive control (BADH gene present in chloioplasis 
from transgenic plants already oonfirmed) 

Confirmation of integration of auaW and biopolymef-proinsulin fusion genes into the 
chloroplast genome - Prinwis: 2P/2iyi . 



c.o; 



1 



I 



tJkb-^— 



IJkb- 



5 6 



PCR of pLD clones: Luno-, ! . Ikb marker, 
2, Ll7a; 5. LI 9b: L8± 5. L9; Petit havana 
(not transgenic): pLD vector as poativc 
control 



PCRof pSBL clones: Lanes, K Ikb marken 

S 1 7a: 3, S30i^ 4. S7a; 5, S41 b;6, L9(mutant) 
r. Petit havana (not trans^eniG); 8, pSBL vector 
as positive contrd 
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Figure 3 : CTB Gene Expression and Chloroplast Integration 




Western Blot analysis of CTB 
expression in E.coli (13% PAGE): 

Lane J : Purified bacterial CTB {0.5^g) ; 2 & 4: 
Transformed E.coli culture-24 h and 48 h rcsply. ; 
3 ft S : Untransformed Ecoli cultuie-24haiid48h 

re^ly. 
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1.65 kb 



3 3 




^ > 




3.0 bb 



pLD^LH-CTB (dJ Kb) 




pLD flankiag sequences 



C. PCR confirmution of //£///A gene integration 
into chloroplast genome -3P/3M primers 
123456789 10 11 12 13 14 15 




PCR of clones of IsL roand of scJection : 
Lane 1:1 Kb maricer ; 2 - 12 : Plant total DNA 
from spec/ clones l-l I (Note: Lanes 2 & 6 arc 
mutants); 13: Untransfomicd plant: 14: pLEKLH- 
CTB vector ; 15: No DNA template. 



D. PCR confirmation of integration of 
aadA and CTB gene into chloroplast 
genome - 2P/21VI primers 




PCR ofdones of 2ni round of selection : 
Lane 1 : 1 Kb maricer, 2 - 7 : Plant total DNA fiom 
spec/ clones 1- 6 pNlote: Lane 5 is a mutant); 8: pLD- 
LH-CTB vector; 9: Untransfotmcd plant ; 10 : No 
DNA template. 
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Figure 1 
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Figure 2 
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Figure 4 
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• Expression of bacterial operon in transgenic dMoroplasIs, 



§0.35 
£ 0^ 

I- ai5 

^ 0.1 

0.05 
0 



Cry2Aa2 
C^ne Depression 



Transgenic Leaf Age 



Cry2Aa2 
Oporon ExpiBssion 

591 




Transgenic Leaf Age 




F^re 1: Cry2A protem ocmceiitration detenxuned by ELESA in 
tnuisgiBDie leaves. Note 100»fiild increase in pralein aocuinulatian in 
tile presence c^flie putative cli^)cnnUn, ORF2. 



FWore 2 ±iiniiiiogpld labded dectrpn miooscopy 
ofinature transgenic leaf. G^y2A^ oystals in a 
transgmic cfaloraplast expressing Oe cry2A 
openxL 



• Blxpressioa of a small f22aa) peptide in transgenic chloroplasts. 




Bioassay P. aentginasa 



o 
o 
c 

o 

I 



Transgcnie Untransfonned 

Figure3 . Leaves were in&cted with tO h1 of 8x10^. 

syrittgue. Photos 

were tatei 5 da^ after tnoculattmi. 1-2 ug of 
antimicrobial p^de (AMP) is le^nred to kilj 1000 
bacteria] cells. Local co n ce u Ua b oo at the site of 
iafecdon is cstfmaterf to be 20M00ftg AA£P. 




Transgeruc Wild type Birfferonly 

Figure 4. Total plant protein was mixed with 5p) of mid- 
log phase bactena iiom ovemi^^t oiltnre, incubated for 2 
horns at 25^ at 125ipoi and g^wn in LB brolh ovemi^ 
Based on miniinum inhibitojy concentr^iQn of 1-2 p;g 
AMP/IOOO bacteiiai cells, the esqM^ssion level was 
calculated to be 213-43% of the total sohible protem. 



> Expression of Oligomeric form (disulfide bonded) CTB in transgenic chloroplasts. 




transgenic lino 3 



Tcmg Msure CU 

transgenic lljne 7 




BSA 



GM1 



Figure 5: A) CTB ELISA qnatification is shown as a percentage of tte total soWe plant notein. Total sohible 
plant protem fiom youne, rnahae and old leaves of transgenic lines 3 and 7 was quantified. B) CTB-GMl 
Gandioslde bindmg ELBA assays: Plates coated first witii GMl pngliosides and BSA were plated virtlfa total 
sohible plant protein fiom lines 3 a nd ?, untraiurfonned plant total sdubie protem and purified bacterial CTB. The 
absotbaace orthe GMl gRnglinsidft-CTB antibody complex was measured. 
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• Eroression & assembly of disulfide bonded 
Giiv*s 13 monoclonal antibody , 



fI3tt>— j 



28 ID— 




F^re 6: 12% redudng PAGE. ChemiTiiminesceat 
detecdcm witti rabbit anfdiolera serum (1^ and AP 
labeled mouse anti-rabbit IgG ^2^ antibodies. 
Untransfonned, boiled(l) and unboiled (7); Tfansfinrraed, 
boiled (34$^ andunboiied (4XPQnfied cm bofled 
unboOed GO; Madcer (S). 

■Maricer-free cMoroplast transgmic plants^ 



FigurtT: A, B) reducing gels, lanarkers, 2:TTan5genic 
extract showing expression of li^t (A) and heavy chain 
(B) in dilor(^last5, 3: Untransfonned, 4: Human IgA. Q 
nonnreducing gel. I.Transgwc extract showing assembly. 
2: Untransformei 3: Human IgA. Blots A & C were 
detMted with AP conjugated, ^oat anti^uman kappa 
antiboc^. Blot B was detected with AP conjugated goat 
anti^biiiiian ](gA antibody. 
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TaUe 1: Comparison of Spectinoniycin and Betaine 
ald^i^de as tbe selectable marker &r &e first round of 
sdcctiooL 



• Codon eompositfon and eroression levels. 
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Figure 8: Comparison of betaine aldehyde and 
spSctinomycin selection. A. AT. tabttcum Petit Havana 
control in RMOP medrum containing ^ectinom^rt^ ader 
45 da^ B. Bcmibarded feaf discs seleSed on ^^cBnomydn 
m RMOP tnedmm aSxsr 45 days. C Specthx>nwcin resistant 
clones cultoed again (second loumQ to obtain homoplaany. 
D. Petit Havana ccmtrol in RMOP medium contahuDg 
Betame AMdiyde after 12 days of Cttltma B. Bombarded 
leaf discs selected on B^ame Aldehyde inRMOPmedmm 
ate* 12 days of culture; arrow indicates unborobarded feaf 
disc as c(»itroL that 23 shoots are formed on a disc 
selected on betaine ald^yde against 1-2 shoots per disc on 
spectinomyda. F. Betaine alddiyde resistant clones culhared 
ag^in (secraid fOumQ to obtain homo|dasmy. 



^MiO^able 2 (Left): Black indicates g^aes witti unmodified 
native codon composition and mczr expression levels 
observed m transgjcnic diloioplasts, ranked by AT% in 
ascending order. Red indicates genes to be investigated. 
Kusnadi et aL (1997) suggest that a mmimum of 1% TSP b 
ade^niate for commercial feasability. See section d) for 
detaib of AT content, %psbA optimal codons and % of 
codons &at match the cp tRNA pool TSP: % total soluble 
parotem 
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Biopolviner-Prolnsiilin Fasion Protein Expression 

A. Copper Stained B..Ox>inassieR-250 Stained 



24kDa 




24kDa 




Hg 9yAandBLanes: 1, Prestatned Marker (BioRad); 2, Sonic extracts of pSBL^^ 

ofieaatadoQ of insert in pSBLrOC- XaPris; 4, pLD-OC-XaPlis; 5, leveise arieotaticHi of pLD-OC* XaPris; 

6, R coU XL-1 Blue cells with no phsmid . 

Western Blots of Biopolvmer-Proinsnlin Fusion Protein After Sinde Step pnrfficatfon 
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Yig 10: A £. eott expression and 
deavage Lanes: ], BioRad Piestained 
Marker; % 3ng of Purified Human 
Fnnnsulii^ 3» 5ng of pSBIX)C-XaPri3; 
4, N^ative control, reverse orienlatims; 
5» pSBL esqnessing ceils (6M 
OuanldiQe Hydrocfalortde Pluqifaate 
Buffer, ph7.0X 6, XI^l Blue JBleoIr with 
nop^L. 



Ptomsulin 
(9kDa) 

F% 10: B. £. coli expression and 
deavage Lanes: l,BioRad 
Ptetained Markra; 2, 5ug of 
Purified Human ProinsuHn; 3, 
pSBLrOC-XaPris (6M Guanidine 
ttydrocMoiide PhosplKite BufiTer, 
ph7.0X4, pLD-OG-XaPris;5»XL^ 
1 Bhie£oa// wift no piasmid. 




Fig 10: C Transgenic chloropiast 
expression Lanes: i. Purified^ 
coli protein from pLD-OC-XaPris 
es^ression; 2, negative control 
(Petit Havana); 3-5, Chlon^last 
transgenic lines. Note dimer, 
tetramer and hexamer aggregates of 
polymer-insulin fiision protein 



Confirmation of chloropiast integration and homoplasmvAietCTOplasmY by Sonthera Blot Analysis 



6^kb 
4.47kb 




FlglL Biopolyiner-pniinsulin fusion gjoie 
int^ration into the diloroplast genome confirmed 
by Southern blot analysis. Lanes: I , Petit Havana 
(negative cor^I); 2-5, pLD-OC-XaPris cloi^ T^; 
6-8, pSBLrOCOCaPtis clones % probe^xisitive 
control). Homi^lasmy is seen in most transg^c 
liiKs wh tie a few transgraiic lines show 
hetetoplasniy. 
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• Eroression & assembly of disulfide bondeff 
GuY*s 13 monoclonal antibody . 
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Figure 6i 12% leducii^ PAGE. CSiennluiximesccnt 
detection with lablnt aittircliolera serum (1^ and AP 
labeled mouse anti-rabbit IgG (2P) antibodies. 
UniiansfiHined, boiled(l) and unboUed (2); Transfhrmed, 
boDed(3&S) and unboiled (4XPuiified CTB boiled (6)and 
UDbmled (TX Ikfeite (8). 

♦Marker-free cMoroplast transgenic plants^ 



Figare7: A, B) xeducing gels, hmarkeis, 2:Transgemc 
extract showing eaqnessLon of light (A) and heavy chain 
(B) in cUoroplasts, 3: Untransfoimed, 4: Bmaeai IgA C) 
noiHceducing gpL l.Tronsgemc extract showing assembly^ 
2: Urrtntiwfamiedy 3: Haman IgA Blots A & C were 
detected wi& AP CQIyug^ted ^oat anti-luman lappa, 
ant&ody. Blot B was detected widi AP t^mtft^^nh-A gpat 
anti-faumait IgA antibody. 
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Table 1: Cou^Kiiisoii of ^ectmosiycia and Betaine 
aldyhydo as die selectable marker for the first round of 
selectioa 

» Codon composition and eroression Ievds> 



Open reacfing R^me 


r¥T?a L!f ^ i Lira W L-flfS! :\ ! M 


Ptasiid miniproinsulin 


7 


66 


100 


62 


CTB 


4.1 


66 


47 


34 


Cry2Aopenon 


47 


65 


37 


37 


Plastfd protnsulin 


? 


64 


100 


49 


Antimicrobial pep^de 


21 


63 


35 


35 


Guys light chain 


<1% 


49 


31 


44 


OptimizBd blopotymer 


7 


47 


100 


40 


Guys heavychain 


<1% 


40 


25 


44 


I4uman proinsulln 


7 


38 


26 


44 



Figure 8: Conq)arison of betaine alddxyde and 
spectinomycin seloctiflo. A. N. tabacum Petit Havana 
control in RMOP medium containing qsectinoniycin after 
45 .days. B. Bombarded leaf discs sdected on qpectinoniycin 
in RMOP medium ader 45 daysw C ^>ecttnonwdn resistant 
clones cultured again (second round) to obtain honu^lasmy. 
D. Petit Havana control in RMOP noediam containiixg 
Betaine Aldehyde after 12 days of culture. £. Bombarded 
leaf discs selected on Betaine Aldehyde inRMOP medium 
afler 12 days of cuttur^ anow indirates unbcnobarded leaf 
disc as coEttrQl Note &at 23 shocHs are finmed on a disc 
selected on betaine alc^iy^ lugainst 1-2 shoots per d^ on 
spectinonrydn. F. Betaine ald^^rde resistant cloi^ cultured 
again (second round) to obtain honi^lasniy. 



•^H^Table 2 QLelt): Black indicates genes with unmodified 
native codon conqxxsitiQn and meir expression levds 
observed in transgenic chloroplasts> ranked by AT% in 
ascending cxdss. Red indicates gjenes to be investigated. 
Kusnadi et aL (1997) suggest that a minimnm of 1% ISP is 
adeyn t e fox commercial feasability. See section d) for 
details of AT content, 9£pshA optimal codons and % of 
codons that match tiie op ffiNA pooL TSP: % total soluble 
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BiQPolviner^PrDiiisBHn Fusion Protem Expression 

A. Copp&r Stfiined B. Coomas^e R^250 Stained 



kDa 



24^ 




mg 9,AaiidBIane9:l»PkefltaisedKiaikBr^i^^ 

onentation of inseit in pSBL-OC> XaPris; 4» pLD-OC-XaPris; S, levase onsxtaakm of pLI>OC- XaFd^ 
6, ^. cofi XI^l Blue cdls witti no plasmid . 

WcstBm Blots of Biopolymer-Proinsiilin Fnsion Protein After Single Step purification 



35 



20 
72 



i 2 



3 6 



^- * 



Lyj :> 



Puificd FntiDsattD kite) 



Cbnvcd 



Fig 10: A* E. con eipresston and 
cieayag^ Lanes: I, BioRad Prestained 
Madoe^ 2, of Purified Human 
Proinaidm; 3, 5iig of pSBL-OOXaPria; 

4, Negative control, reverse orientation; 

5, pSBL cAiJ i c aaiug ccUs (6M 
Guanidme HydiodJoride fhospfaate 
Bii£fei; ph7.0); S» XL-1 Blue^cd? wdli 
nopSBL. 




74 



25 kDa 



ProiiiSDfin 

Fig 10: B. £. coli espression and 
deavage lanes: 1» BioRad 
FMained Marker; 2, Sug of 
Pnrified Hiiman Ptoinsulin; 3» 
pSHLpOC-XaPris (6M Guanidme 
Hydrochloride Phosphate Buffer, • 
ph7.0); 4, pLI> OC-XaPri^ 5, XL- 
1 "Blu^BxoU wi& no plasmid. 





















if, - 





































Fig 10: C. IVansgenic ehloroplast 
ezpresriott Lanes: I, Purified A 
coh protein fiiom pLD-OCOCiBPris 
eqnession; 2, negative eontrol 
(Petit Havana); 3-5, Chloroplast 
transgenic lines. Note dimer, 
tetiaiuer and hssQzncr aggr^^stcs of 



Confirmation of diloroplagt integration and homoplasmv/hetCToplasmv bv Sonftmi Hlot Analysis 



6.67 ld>- 
4.47U>- 




Figll. Biopotymer-proiisuHn fusion gcuc 
integratum into tiie ehloroplast genome confirmed 
by Southern blot analysis. Lanes: 1, Petit Havana 
(negative ccntroQ; Z-S, pLD-OOXaPzis clones Tq; 

pSBL-OC-XaPds denies 9, probe(positive 
control). Homoplasmy is seen in most transgenic 
lines while a &w transgenic lines show 
heterophunr/'. 
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• Expression of bacteial operon in transgenic diloroplasts. 



S 0.3 

|S 0.16 
# 0.1 

0.05 
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Transgenic Leaf Age 



Cry2Aa2 
Operon Expression 
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CoolnJ young raatdro oM 

Transgenic Leaf Age 




FIgnre 1: O72A protBUi concoitratiGO de tennine d fay BUSA in 
tzansgjBDic leavesL Note 100»fold increase in protein accunmlation in 
^ss ptcseoce of tfiepnlative chat)guiitn» 0RF2. 



Flgnre 2 'Jmnuoogold labeled electron xmcaroscopy 
of mature transgeoic le8£ Ciy2Aa2 crystals in a 
transgenic diloroplast eapiewang ue CKy2A 
opeton. 



> Expression of a small f 22aal peptide tn transgenic diloroplastSL 




BioassayP» aeruginosa 



Transgenic IMtransfonned 

Flgare 3 .Leaves were infected with 10 fil of 8x10^, 
SxlO^ySxlO' and SxlO^celb of P. sytiF^ae. Photos 
were taken 5 afier inoculatioo. 1-2 ug of 
antimipobial peptide f AMP) is xecpnred to ]^ 
bacterial cella. Local ^^^ nrfi tTBt^ ro i at the site of 
infection is estimatBd to be 200-S00|ig AM?. 




Tranegenie Wild type Buffer only 

Flgnre 4. Total jdant protein was mixed with Spl of mid- 
log phase bactena fiom ovezni^ culture, i n c u bated fixr 2 
hwirs at 25^C at 125i|nn and grown in LB broth ovct^iL 
Based on minimum inhibitory concentration of 1-2 pg 
AMP/1000 bacterial cells, the csqnesrion level was 
calcnlatBd tobe 21.5^% of Oe total sotoble protein. 



' Expression of OKgomeric form fdisoBMe bonded^ CTB in transgenic cMorDplasts> 




transgenic Dne 3 



ou 

transgeiriclbie 7 



A 


i.4r 


B 


1^ 


S 


1 


0 




R 


0.8 


B 


o.e 


A 


0.4 


N 




C 




E 





0 Trans.Une7 

□ PufifiBGlCTB 

■ Untransfbmied 
plant 



BSA 



S 



GM1 



Flgnre 5: A) CTB ELISA qoatifieaiion is diown as a percenta^ of the total soluble plant protein. T otal s oluble 
nmt protein fiom young, mature and old leaves of tiansgemc lines 3 and 7 was qu^ostified. B) CTB-GMl 
Gangnoside binding SUSA assays: Plates coated first with GMl gangliosides and BSA were plated with total 
soluUe plant protein firom lines 3 a nd 7, nntransfhnned fdant total soluble protein and purified bacterial CTB* The 
absozbaice or die GMl gangUoside^TTB antibody conqpler was measured. 
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We^MaikBr 

5CF 

PsbA5'UTK- SjntiKsis 
200bp 

200bp MW 
Maikcr 

OK]ln>l(iio 

gure l(above): domngof ItiePsliA S^iistransl^^ 
icm (5'UTR) fiom &c dilaiplast gmcme 
gUro3 (below): a compaarisisoa of DNA scqacnccs of native 
man proinsalin (tcp) andplasddinodfified piomsolm (bottom) 

fcgt^ ftao c a a c ;cwjt^Lat.gcggct cac a c ctggtggaagct.ctctacctagt.gt.gcggg 

II liltilltl i (I II U Hill II lilili I lit lilt II 11 

cgtaaaocaacacttatsftggttetoacctagtagaagotttataektagtatgtggt 




a cgag gcst'tcttc 

III 11 llilillllll II II 



I II II II II II II I II II II 

icgtgaagctgaagatttacaagtaggt 



TT 

Figure 2 (above): SOEtng of 5'UTR to fba 
CTB- human proinsulin sequence 5CP is the 
PSbA 5'UTR and the Cliolcra Toxm B subimit 
(CTB) human proinsulin ftasLon 

Figure 4 (bdow): Recofsive PCR to syntikesizB 
the dilcsoplast modified jntitiunilin (Plpris) 

PTPris280bp 



tgcaggcagcctgcagcccttggccctggagggg 

II II I II II II iiniiii II I II II II II I II II 

agtBsaaCtaggtggtggtcctggtgctggttcrtttacaacctttagctttBgaaggt 
cctgcagaagcgtggcattgtggaacaatgctgtaccagcatctgctccctctaccag 

I II II Hill mil iiiiiiii mil ii ii ii i ii 

Waoa aaa aog tggtattgt a g aii c w i at sittgtaobtotatttgbtottbabaooaa 



II iiimii iiiii 



Native Human Ptoinsulin 

Chloioplast Modified Proinsalin 

MW 



2000 — I 
1650 

1000 





f%1ir e 6 (above): PCR proAicts to con6nn construct 
integratian into the chlcroplast genome using two pzimeis» 3P 
and 3M. 3P anneals to the native diloioplast genome and 3M 
amieak to the intioduoed specdnomycin resistance gene;^ iutdA, 
creating a 1600 bp product only in transgenic clones 

F^ure 5 QeSty. SOEingof ^S'UIR, CnB»andplastid 
modified promsa]Jn»i(Mdi results inte fhsion of aU 
sequences denoted as 5CPTP. Ihe sec(md Isne show tins 
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* Erorcsrion of bacterial oneron in transgenic chloroplasts. 
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Transgenic Leaf Age 




figure 1: Qy2A protem cozKxotraticm detemuned by BUSA in 
tiansgmc Icsvesw >fote 100-fold incxease in pratdn accmnnlation in 
fliepngicnoeof thepiitolivechflpcniniii^ QRFX 



Figure 2 rlnnumogDld labeled electnm microscopy 
of matme tian^geoic leaf Q72Aa2 oystak in a 
tiansgenic chloroplast expiessing me ay2A 
openm. 



• Eipresdon of a small f22aa^ nmtide in transgenic chloronlasts. 



\ aerugmosa 



■■ ■■ l --^ W 



Transgenic Untransfinmied 

Figare3 . Leaves were infected with 10 pi of 8x10^, 
8x10^, 8x10^ and SxlO^ cells of i^j>7intg08; Photos 
were taken 5 davs afier inocoIatiGn. 1-2 of 
a n ti miccobial peptide f AMP) is reyir^ 
bacterial cells. Local concentration at the site of 
infedum is esttmated to be 200-tt)0pg AMP. 




Transgonlc WildtipQ Buffer onlfy 

Figure 4. Total plant piotdn was mixed with 5[il ofwSdr 
log phase bactena fiomovenu^ cnltur^ iocntSaled for 2 
horn at 25^ at 125rpm and grown in LB bro^ ovetnighL 
Based on minimnm inhibitory conceatration of 1-2 pg 
AMP/1000 bacterial cells, the expression level was 
calcnlated to be 21SA3% of fte total sohable protenx 



* Erores^n of Oligomeric fom f djsnlfide bonded) CTB in transgenic dJoroplasts, 




Vonng Mntm OU 

trensgeidc line 7 



1J2 



1 

0.8 

o.e 

0.4 
0.2 



TraraJine3 

□ 

□ PurifiedCTB 

■ Uitransfonned 
pfaant 



BSA 



B 



GM1 



Flgore 5: A) CTB EUSA qoatlficatlon is shown as a perceota^ of tiie total soluble plant proteia Total soloble 
pl^ protein firom young, mature and old leaves of transgenic lines 3 and 7 was quantified. B) CTB-GMl 
GangWnstde binding EUSA assays: Plates coated first wifii GMl gan^ii^ides and BSA were plated with total 
soloble plant protein fiom lines 3 a nd 7, mitnmsfnrmed plant total sohibfe protein and purged bacterial CTB. The 
absoibanceortfaeCBfl ganglioside-CIB antibody complex was measured. 
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' Expression of CTB oHgomers^ 




* EroressioB & assembly of disulfide bonded 
Gpv^a 13 monoclonal antibody , 




1 2 3 




6: 12% reducing PAGE. ChemihuDuiesceiit 
detecdcm wifli tabbit and-cholera sennn (l^ and AP 
l^)eled mouse anti-rabbit IgG (2?) antiboifies. 
Untransfimned* boiledri) and unboiled (2); Transformed^ 
boiled (3ftS) and miboitel (4^Purified CTB boiled (Qand 
nnboOed (TX Madur 9). 

* HSA Nndear transfonnation of Dotato plants > 



Figiire7: A» B) reducing gels, lanarkers^ 2:Tiansgenic 
extract showing e^iression of light ^A) and heavy chain 
(B) in chlarcplasi8» 3: XJntransfmine4 4: Human IgA. Q 
soa-ceduciiip gpL IJTxaasgffiic extract showing assembly* 
2: lMcans&med» 3: HamaD Blob A & C wcce 
detected with AP comu^ded goat anli-lumBa k^ypa 
antibody. Blot B was detected widi AP cofi|TigptBd goat 
antt^nimatt IgA antibody. 




Retattre 
ficquency ojjo 



Figure 8: Western Blot of transgeoic potato tubers, cv 
D^iree. 30 ^ of tuber pzotein was loaded per lane and 
jnxsbed with antirHSA antibody. 1: wild type; 2:40 ng of 
pure HSA; 3-8xlifcrcnt trai^enic hnes^ showing 
different levels of expiessioo. 



• Expressfam of HSA by ddoroplast 
vectors In JL caiL 





o ao2 004 ao6 0108 ai ai2 oih ai« ais 02 
^HSAr in the total soluble protein 

Figure 9: Frecuency histogram including percentage 
Kennebec and Msff^^e tran^eoic plants esqjressing 
different HSA levds. Results are shown as the 
percent^es of tcan^enic phutfs (vertical axis) that 
express a s|)ecific level of HSA of Ifae total stdufailB 
piuteiu Qmnzontal axis). 

» Codog composittpn and eixpresston leyels> 



Flgura 10; Western Blot of K coU protein 
extracts. 1: SO ngpure HSA; 2: molecular weig}! 
marker, 3: pLD^A (control without RBS); 4: 
PUV SUTR-HS^ 5: pIJ>-RBS-HSA; 6: pLIV 
QRFl-^2-HSA; 7: & coli wi^iout pLD vector. 



Open reading Frame 


%TSP 


%A*-T 


%psbA 


%cptRNA 


CTB 


4 


66 


47 


34 


Ciy2A6peft}n 


47 


65 


37 


37 


Anirmicrobtai pepfide 


21^ 


S3 


35 


35 


HSA 


? 


57 


57 


47 


irUerfeton alpha 


? 


54 


31 


40 


RUBISCOssTP 


? 


50 


32 


42 


Gu/s light chain 


<1% 


49 


31 


44 


IGR 


? 


41 


20 


30 


6u/b heavy chain 


<1% 


40 


25 


44 



Table 1: Unmodified native codon conqiosition and expiessioa 
leveb obsoved in transgenic chlQnn>lasts. See section d) fiir 
details of AT oonteDt, %pshA optimal codons md % of codans 
that inatch the q> tRNA pool TSP: % total sohible protein 
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* Soufljcm blot analysis of HS A transgoiic tobacco plants^ 



"Tl^ 1 



Probe ^ 

Bgin 



UntransGmned 



BcoRI 



4JKb 



EcoRI 



EcoRI 




2:27Kb 



SJZSKb 



EcoRI 



r T t ii. a n«rn > i ned Otfl Qp-HSA 



5.4SKh 



EraRI 



430- .1 



OrftOtfS 



6^Kb 

Figure 2: Southern Blot of HSA transgenic plants. Untransfonned tobacco DNA vs. 
transgenic tobacco DNA digested wtfli BcoRL 1^,4: DNA from plants ttansltoDed 
wifli pLD-RBS-KSA; 5 6,7: pLD-5*psbA-HSA; 8^,10: pLD-Orn-Orf2-HSA Note 
Imioplasiny in aS clones except nimd)^ 




• Nortfaera Mot analysis of HSA transgenic 
tobacco plants, 

12 3^4 



Kpolycistrons 



• ELISA a natysia ftf TIR 4 transgmic tobacco 
plants^ 




2 Kb DsonocistrQii 



FSgare 3: Northern Blot of HSA faantSgnnic 

usii^ HSA probe (L8 ld>). 1: imlrMBfonned to 

RNA 2: RNA from plants txansfonocd widi: pliX- 
imS-HSA; 3: pLD-Orfl-Oi&HSA; 4: jiUXS'p^- 
HSA Note different sizes of traoscnpts and tiie 
presence of BK Maocist r oas in number 4. 




Figure 4: of HSA transgenic plants. A-Ea-2: HSA 

standards; F/1-2: Bbnlq Gfl-Tx uotrai^famied Petit Havana 
proteia extract^ proteins from plants transformed 

ivith pLD-QEflQx£2.HSA; F-QO-4 and D-H^-S: p£i>.RBS- 
HSA; Rest oftlie wells omtain extracts ftomdifEcrentcloaies 
transformed with pLD-5*psbA-HSA 



• IGF4: op ttmfgpii Bft onmce and PCR prodnct after synthesis of the new gme. 



B 



tgtggagaGagg ggct tttatttcaaraagcccacagggtatggetccagc 
agtogga993o9cctcagacaggcatogtgeatgagtgctgcttceggagc 
bgtgatctaaggaggctrggagatgtatitgcgcacccctcaagcctgccaag 

tcagct 



ggtcctgaaactttatgtggtgctgaattagtagatgctttacaattcgta 
tgtggtgatcgtggtttxrtatttcaacaaacctactggttacggttcttct 
tctcgtcgtgctectcaaactggtattgtagatgaatgttgtttccgttct 
tgtgatttacgtcgtttagaaatgtactgtgctcctttaaaacctgctaaa 
tcfcgct 




Fignre 5: A) IGF-I native sequence coding 
for the mature jnntdn. B) IGF-I optimiaed 
sentience according to chioroplast preferred 
codon usage. Note changes in red. C) IGF4 
synthetic geoe afieriecursivePCR. 
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•TPwprew ton qf a small 



ffesrei, Leaves we infected w& 10 )il of 8x10^, 
830*,8xlCPairi&deficeUsoCK9T»WBe: Photos ^ 
were taken 5 days afier fnocnhtian 1-2 iig of 
lOtiaiGn^ pcpSde £AMF) B lemnied Id kOi 
baoteiiai cdk Local oxnoeoliaboa at flie site of 
infioctioah BstimatedtobeaOO-800|igAMP. 




Rg^X Total plaatprotm was mixed ^wi^ of mB- 
pbase bactena fiom owBcni^gldt ciiitsfty aKuooted fis 2 
liomsat25^at 125rpniaxKigrowninl3bn>ai ovomig^ 
Bmed oa ffi^""*™*" inlubatDiy cuue aifa a ii oa of 1-2 |ig 
AMPAOOO baetenal oeOs^ Ibe mpreyalnn level was 
o8leidatBdto1ie21^3% ofilietDlaisoliiltelHDtBiiL 



>Tg ^reMiim off OlIgMncaic farm fdisnlfide bonded^ CTB in teUMgeak ddoroptots. 





^^.^.^ lisnsQQnIc Bno 7 

K^xre 3: A) CIB KIIS^ qmifletlte is shown as a pereeot^ of &b total sobi^ v^axt proteioL T o=td[ s otoble 
T%faTTt nroAefai finm youas* XDatun and old leaves of t]"»"^ e*"^ ^nes 3 and 7 was apaxmfied, B) CTB-6M1 
btodSBS ELEA bsm^ Kate .ooBted fiist wil£ Oifl gan^Sodd® «nd B^ were plated w^ 



ixtodtes r.USA BSBsyaz JWa.wM tnst wim umi ea n snt T w n g s «!» xkwv pmwu ^«ui luuu 
adbd>3a pbnt pnitBsi fiosn fizies 3 md7» udraDsfimned plazd total fic»iible ptotem an ponfied oactBoa] CIB. The 
dawfaarogortbeOMlgna^nsid^Cra 



► Enwessioa of CTB i^igomcm 



• KxprM^ioB & assembly of dlsn lfiilft hnndcd 
Gbv*s 13 monodonal antibody . 

ABC 

1234 1234 123 



- 55 



- 33 




PlgnreS: A* B) xedncoos ff>^ lannricCTS^ 2;llniii8gBiiic 
attiadt sixfwmg eajueaaioQ of Bght ^) and bevy ofa^ 
3. U k A i tt iiaCu f i o cd, 4i I 



^ 4{ 12% tedDcn^ PACE, rhwnn i iitit uescent 
dcSrtfam widi anti-dH^Bia senin (1<) and AP 

labeled mouse antMabfaat IgO <^ antibodies^ 
lAdrana&Hmdd, boitedri) andonboOsd @; Trazisfbi 
bQi]ad(3ft5) aodanb^ed (4);piiifiedC^ 
iiiiboilBii(7);lMarQX 



: Human I^A. C) 



ooKodocii^ gd. LTmo^eniD eidi&ct dxiwicg assiuibly* 
2: UuliaJ»B miB<L 3: Human IgA. Blots A & C were 
detected with AP CQ ^pgged ga t an tHiuinam j appa 
Botibody. Blot B was oetocted irah AP ooDjMgated gMtf 
BOt^^hunan feA antibody. 
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98gne <fe WesSem Bka of tcmsg^amo potato tobersgcv 
Dotneu 30 of tite protm WB5 loaaed per 1^ 
probed «i&£atti^ESAaidEbody. InvDdto^ 2:40iigcf 
pure B3A; S-Ssfifoefll taa^gBmc fines, i2»vni^ 
ulUBRJUt iBveis 01 GoaMressMD. 



> Erpreaaiom ofHSA bv chliMroDlast 



1 2 3 4 9 6 7 



o cica ao4 ouos «Loa ax cu3 om axtf ais ctjt 



^jaSAr telte total soinUepxvtein 

yj^Die 7: Ereoofim^ faistogiam 

Kamebec and Dtaite ' 

diSereot BSA tevdL 

c9C|jf6ss a spociCb level 
piotfiia QMHizonial axis}. 




^Banircaslon ofHSAvfa etdoroplast genome in tobacco. 



2 3 



75- 
3»- 



FignraSs Western Stait^R, co^l {HTOteiticadrBctsL 
1: 30 ng jMne EESA; 2: motecobr vre^ markei; 
3: pM>HSA (ocmiRil wi&md SBS); 4: FLD- 
SOmL-BSA: 5: pUVBB&^iaA: 6: jAJV 








Heam 9: We^em Boi of tobocoo fffotein 
«]S»ois. h40ngpiyeHSA;2:infJpinilarwa^ 
marker; 3 and 4r type plant extracts; 5i 
extracts fbon plaids transfo nnc d yn&i PLl^ 
S'OTMSA; 6: pLD-HBS-HSA; 7: pLD- 
0BF1+2-BSA. 50 sBOOgcains of plaofc pnstrak 
Twere loaded per wbH 



♦PCR apalvsfa of tnmaformairts to detcrmiBe integnttiop of HSA gene ftatp tfie ehteroriast genome. 




3BS 



Bsiua 10: A) Mf^) <£ As pL0 dilon^ktefit 
t miiffform nriop vectDr aDdjuBoar Jan£i^ flies. 
Awose gi^ cmtaTntng VCR. products usiiffi total 
pmt DMA as tenndatB fium i^Dts transJomied 

BSA; 8^,10: pUXXm^2-mAi U.]2»13: pU>- 



